Turn foundation models into production AI on Microsoft Foundry | BRKSP91

Vivek Chauhan explains how to move from generic foundation models to production-ready, use case-specific AI by combining Fireworks AI training/inference capabilities with Microsoft (Azure) AI Foundry, focusing on practical patterns to reduce cost and latency and deploy at scale.

Overview

This Microsoft Build 2026 breakout covers how teams can operationalize foundation models by:

Session segments (from the published chapters)

Fireworks' inference engine and LLM serving optimization

Flexible training options for teams at different stages

PTU mode for production workloads

Live demo: model catalog on Azure AI Foundry

Integrating deployed models into Azure agents

Case study discussion: open-weight models and post-training

Partnership value and closing guidance