Scale agentic AI from on-device to cloud orchestration | BRKSP92

Karthik Vijayan, Colin Helms, imran Sheik Mohamed, and Jayneel Vora present a Microsoft Build 2026 breakout on designing agentic AI systems that run across client devices, edge environments, and the cloud.

Overview

Modern AI systems often span multiple environments rather than running as a single model in one place. This session explores how agentic AI workloads operate across client, edge, and cloud through three demos:

The session also focuses on practical guidance for deciding where to place inference, reasoning, and orchestration to balance responsiveness, scale, and efficiency.

Session chapters (from the video)

Key themes

Placing AI capabilities across environments

Orchestrating multi-agent systems on AKS

Performance and hardware considerations

Demo workflow elements