Microsoft Developer hosts Pablo Salvador Lopez and Priyanka Vergadia in this episode, providing hands-on insights into building scalable omnichannel Voice AI agents with Azure’s Speech Services and multi-agent orchestration.

Building Omnichannel Voice AI Agents with Azure

Overview

This session from Sip and Sync with Azure, featuring Pablo Salvador Lopez and Priyanka Vergadia, demonstrates how organizations—like those in insurance—can create real-time, multilingual customer service bots using Azure Voice AI.

Key Topics

Demo Walkthrough: Real-world insurance claim scenario
Multilingual Support: Seamless switching between languages (English, Spanish)
Intent-Based Workflow: Automatic agent handoffs for specific queries, e.g., claims or policy details

Architecture Breakdown

AI Layer

Multi-agent orchestration enabling specialized agents for tasks
Intent recognition and AI-driven conversation management

Speech Layer

Integration of Azure Speech Services for speech-to-text and text-to-speech
Language detection and real-time voice interactions

Application Layer

Use of WebSocket and WebRTC for real-time voice pipelines

Telephony Integration

Azure Communication Services connects voice AI to telephony systems

Tools and APIs

Voice Live API accelerates voice application development
Multi-Agent Accelerator solution provides reusable architecture patterns

Best Practices

Architect for scalability and seamless agent handoffs
Optimize multilingual pipelines for diverse user bases
Leverage real-time processing through Azure’s Speech and Communication Services

Getting Started

Speakers

Pablo Salvador Lopez - Principal Solution Engineer (LinkedIn)
Priyanka Vergadia – Principal Cloud Advocate (LinkedIn, Twitter)