Weekly AI Roundup: Local Inference, Agents, and Multi-Model Tools

AI technologies on Microsoft platforms continued to grow in hardware compatibility, agent reliability, model choice, and practical deployment, following the themes established in recent weeks. Guides and releases remain focused on bringing updated AI solutions into daily workflows, supporting best practices across both cloud and edge environments.

Azure AI Foundry and Studio: Unified Generative AI Platform

Azure AI Studio (now Azure AI Foundry) establishes itself as a central workspace for developing generative AI and deploying LLM solutions, spanning model options including OpenAI, Meta, Mistral, and more. The platform supports prompt engineering, fine-tuning, retrieval-augmented generation (RAG), and offers both code-first and low-code interfaces. GPT-4o adds voice and multimodal features, and Phi-3 offers options for lightweight inference. Security and governance improvements allow organizations to adopt responsible AI usage with integrated monitoring and compliance. Developers should remain aware of billing and vendor lock-in as they work with the platform. Foundry Local v0.7 brings support for Intel/AMD NPUs on Windows 11 and simplifies local inference and AI runtime management. Installation with winget (Windows) and brew (Mac) reduces setup friction for multi-platform development. Windows ML is now generally available, providing ONNX-based local inference for privacy and edge execution in Windows applications. Integrated with AMD Ryzen AI, Intel OpenVINO, NVIDIA TensorRT, and Snapdragon NPUs, Windows ML works closely with the App SDK and includes streamlined model conversion via the AI Toolkit for VS Code—highlighting edge AI’s readiness for production scenarios.

Secure and Reliable AI Agent Development with Azure and MCP

AI agent development is improving with integration methods for durable, reliable operations—building on previous agent orchestration and security content. Developers now have a step-by-step guide for using the OpenAI Agent SDK with Azure Durable Functions to support persistent state, retry logic, and distributed workflows, using decorators and orchestration functions to manage errors efficiently and reduce manual coding. The final Agent Factory installment explains how to build a secure, standards-based agent ecosystem using the agentic web stack—covering identity, trust, and compliance via Entra ID alongside open protocols. Practical tips on integrating standards and secure orchestration are included, addressing both Microsoft and open-source tools.

Model Context Protocol (MCP) and Registry: Best Practices and Interoperability

Azure's Model Context Protocol further embeds governance and security in AI workflows. Technical analysis highlights how MCP best practices in GitHub Copilot and VS Code enable automatic compliance and security enforcement, particularly for infrastructure-as-code scripts. Dynamic prompt instructions help teams maintain up-to-date policy compliance. A video walkthrough introduces the GitHub MCP Registry, allowing developers to locate and connect MCP servers for agent development and modular design. Additional guidance outlines secure MCP server integration for Logic Apps and Copilot Studio, including authentication and deployment recommendations.

Microsoft Copilot Studio and Model Selection

Copilot Studio adds model selection for Anthropic's Claude Sonnet 4 and Opus 4.1, alongside OpenAI's GPT models, enabling prompt- and logic-level model configuration. Admin options in Microsoft 365 and Power Platform allow for domain-specific assignments and automated fallback rules—providing more control for organizations pursuing consistent automation.

Microsoft Fabric: LLM Analytics, Real-Time AI, and Workflow Automation

Fabric Data Agent now supports mirrored cloud databases, allowing natural language queries and multimodal analytics using Delta Parquet mirrors. Previewed anomaly detection in RTI streamlines streaming analytics with integration into Teams and email alerts. Agent Mart Studio’s expanded connections with Fabric and OneLake enhance low- and no-code workflow automation for data professionals and developers.

.NET and Multimodal AI: Text-to-Image Capabilities

MEAI adds text-to-image generation in .NET, providing a consistent API that abstracts providers like Azure AI Foundry, OpenAI, and ONNX. This update prepares for future image-to-image and image-to-video support, making multimodal AI more accessible for .NET applications.

SharePoint and Microsoft 365: AI-Driven Content Intelligence

SharePoint's Knowledge Agent (public preview) delivers AI-powered automation for content metadata, summaries, document comparison, and rule creation, with workflow integration into Copilot. Controlled pilot programs, governance, review cycles, and training are emphasized for effective enterprise use.

Building and Operationalizing AI-Powered Agents

Developers continue to build practical agents, with a tutorial on creating a resilience coach using Azure OpenAI and Python. Additional resources show agent memory management with Semantic Kernel and Azure AI Search, alongside customization guides for LLMs and Cognitive Services. An operational workflow demonstrates post-call analytics using Azure OpenAI to process transcripts and feed CRM systems.

AI for Social Impact and Enterprise Architecture

UNHCR, Microsoft, and GitHub share new uses of drone data and open-source AI for sustainable planning in refugee settlements, showcasing adaptive open tools. Updated architecture frameworks now account for AI requirements, MLOps, and explainability. Sustainability remains a priority, with AI solutions for digital twins, forecasting, and energy-use reduction continuing the focus on practical environmental reliability.

Other AI News

Research teams at Microsoft, Drexel University, and the Broad Institute present generative AI for rare disease diagnosis, utilizing Azure AI Foundry for evidence aggregation and collaborative genome review—a continuation of last week’s healthcare AI initiatives.