Profile and optimize agentic AI on Windows | DEMSP384

Freddy Chiu demonstrates how to profile and tune agentic AI applications on Intel-powered Windows PCs, focusing on end-to-end performance across CPU, GPU, and NPU. The session shows how to collect telemetry, identify bottlenecks, and apply practical optimization techniques to improve responsiveness and power efficiency.

Overview

This Microsoft Build 2026 demo session covers profiling and optimizing agentic AI apps running on Windows, with emphasis on measuring performance across the full hardware/software stack and using telemetry to find and fix bottlenecks.

What the session covers

Profiling goals for agentic AI apps

Telemetry types

Intel tracing integration (ITT)

System-level hardware/software interaction

Tool invocation and optimization workflow

Platform telemetry and hardware-level metrics

Custom instrumentation and task creation

Demo: model and compilation-time analysis

Tools and technologies mentioned

Session metadata