Microsoft Build 2026: MAI models in Microsoft Foundry across text, image, voice, and speech

Name: Microsoft Build 2026: MAI models in Microsoft Foundry across text, image, voice, and speech
Uploaded: 2026-06-03T18:50:22+00:00
Description: Learn Microsoft AI covers Microsoft Build 2026 announcements for MAI models in Azure AI Foundry, spanning text, image, voice, and speech modalities, and...

Jun 3, 2026 by Learn Microsoft AI

Learn Microsoft AI covers Microsoft Build 2026 announcements for MAI models in Azure AI Foundry, spanning text, image, voice, and speech modalities.

Overview

At Microsoft Build 2026, Microsoft announced updates to its first-party model lineup in Microsoft Foundry, expanding coverage across four modalities:

Text
Image
Voice
Speech

The video highlights the following models and their intended use cases for developers building AI applications:

MAI-Thinking-1: positioned for advanced reasoning and coding scenarios.
MAI-Image-2.5: positioned for image generation and image editing.
MAI-Voice-2: positioned for multilingual voice cloning and text-to-speech.
MAI-Transcribe-1.5: positioned for speech-to-text, with support for 43 languages.

Microsoft Build 2026: MAI models in Microsoft Foundry across text, image, voice, and speech

Overview

Links