Hugging Face open‑source models to production on Microsoft Foundry | DEM320

Vaidyaraman Sambasivam, Osi Otugo, and Jean Boudier demonstrate an end-to-end flow for taking Hugging Face open-source models from discovery to production inference using Foundry Managed Compute in Azure AI Foundry, focusing on scaling, governance, and avoiding direct GPU management.

Overview

This lightning talk (DEM320, Microsoft Build 2026) focuses on operationalizing open-source models in production by deploying and scaling Hugging Face models on Azure using Foundry Managed Compute inside Azure AI Foundry.

What the session covers

Deploying Hugging Face models on Azure via Foundry

Production concerns addressed

Ownership and control with self-hosted weights

Hugging Face ecosystem context

Runtime and hardware considerations

Demo: deploy a model and use it in an agent

Resources

Speakers