Create multimodal AI agents with persistent memory | DEMSP390

Edo Segal demonstrates how to build multimodal AI agents with persistent memory, including a live walkthrough of provisioning Napster as an Azure resource and integrating the agent securely with Azure AI Foundry.

Overview

This Microsoft Build 2026 session focuses on building a working “video AI agent” that can operate across multiple user touchpoints (web, app, store, support) while retaining context over time via persistent memory.

Key themes covered in the session description and chapter outline include:

Session structure (from chapters)

Starting point and motivation

Building multimodal agents via API

Provisioning and setup on Azure

MCP Server embedded in JavaScript

Secure integration with Azure AI Foundry

Demo example

What you should take away