Move AI workflows from test to production on Microsoft Foundry | DEMSP383

Vignesh Sridhar demonstrates how to run high-performance LLM inference on Microsoft Foundry using Fireworks AI, and how to take an AI workflow from testing into production with a unified deployment and evaluation flow focused on latency, cost, and quality metrics.

Overview

This Microsoft Build 2026 demo shows an end-to-end workflow for moving an enterprise AI use case from test to production by running high-performance inference directly on Microsoft Foundry, using Fireworks AI integration.

Key themes covered in the session description and chapter outline:

Session context