Weekly AI Roundup: Local Inference, Agents, and Multi-Model Tools
AI technologies on Microsoft platforms continued to grow in hardware compatibility, agent reliability, model choice, and practical deployment, following the themes established in recent weeks. Guides and releases remain focused on bringing updated AI solutions into daily workflows, supporting best practices across both cloud and edge environments.
Azure AI Foundry and Studio: Unified Generative AI Platform
Azure AI Studio (now Azure AI Foundry) establishes itself as a central workspace for developing generative AI and deploying LLM solutions, spanning model options including OpenAI, Meta, Mistral, and more. The platform supports prompt engineering, fine-tuning, retrieval-augmented generation (RAG), and offers both code-first and low-code interfaces. GPT-4o adds voice and multimodal features, and Phi-3 offers options for lightweight inference. Security and governance improvements allow organizations to adopt responsible AI usage with integrated monitoring and compliance. Developers should remain aware of billing and vendor lock-in as they work with the platform. Foundry Local v0.7 brings support for Intel/AMD NPUs on Windows 11 and simplifies local inference and AI runtime management. Installation with winget (Windows) and brew (Mac) reduces setup friction for multi-platform development. Windows ML is now generally available, providing ONNX-based local inference for privacy and edge execution in Windows applications. Integrated with AMD Ryzen AI, Intel OpenVINO, NVIDIA TensorRT, and Snapdragon NPUs, Windows ML works closely with the App SDK and includes streamlined model conversion via the AI Toolkit for VS Code—highlighting edge AI’s readiness for production scenarios.
- Azure AI Studio and AI Foundry: Microsoft’s Generative AI Platform Explained
- Foundry Local Meets More Silicon: Expanded AI Runtime and NPU Support
- Windows ML Now Generally Available: Empowering Developers to Deploy Local AI on Windows Devices
Secure and Reliable AI Agent Development with Azure and MCP
AI agent development is improving with integration methods for durable, reliable operations—building on previous agent orchestration and security content. Developers now have a step-by-step guide for using the OpenAI Agent SDK with Azure Durable Functions to support persistent state, retry logic, and distributed workflows, using decorators and orchestration functions to manage errors efficiently and reduce manual coding. The final Agent Factory installment explains how to build a secure, standards-based agent ecosystem using the agentic web stack—covering identity, trust, and compliance via Entra ID alongside open protocols. Practical tips on integrating standards and secure orchestration are included, addressing both Microsoft and open-source tools.
- Enhancing AI Agent Reliability with OpenAI Agent SDK and Azure Durable Functions
- Agent Factory: Designing the Open Agentic Web Stack
Model Context Protocol (MCP) and Registry: Best Practices and Interoperability
Azure's Model Context Protocol further embeds governance and security in AI workflows. Technical analysis highlights how MCP best practices in GitHub Copilot and VS Code enable automatic compliance and security enforcement, particularly for infrastructure-as-code scripts. Dynamic prompt instructions help teams maintain up-to-date policy compliance. A video walkthrough introduces the GitHub MCP Registry, allowing developers to locate and connect MCP servers for agent development and modular design. Additional guidance outlines secure MCP server integration for Logic Apps and Copilot Studio, including authentication and deployment recommendations.
- Teaching the LLM Good Habits: How Azure MCP Uses Best-Practice Tools
- A Deep Dive into the GitHub MCP Registry for AI Agents
- Connecting Azure Logic Apps MCP Server to Copilot Studio Securely
Microsoft Copilot Studio and Model Selection
Copilot Studio adds model selection for Anthropic's Claude Sonnet 4 and Opus 4.1, alongside OpenAI's GPT models, enabling prompt- and logic-level model configuration. Admin options in Microsoft 365 and Power Platform allow for domain-specific assignments and automated fallback rules—providing more control for organizations pursuing consistent automation.
Microsoft Fabric: LLM Analytics, Real-Time AI, and Workflow Automation
Fabric Data Agent now supports mirrored cloud databases, allowing natural language queries and multimodal analytics using Delta Parquet mirrors. Previewed anomaly detection in RTI streamlines streaming analytics with integration into Teams and email alerts. Agent Mart Studio’s expanded connections with Fabric and OneLake enhance low- and no-code workflow automation for data professionals and developers.
- Unlocking LLM-Powered Analytics with Fabric Data Agent and Mirrored Databases
- AI–Powered Real-Time Intelligence with Anomaly Detection in Microsoft Fabric (Preview)
- Building AI Agents for Enterprise Data with Agent Mart Studio and Microsoft Fabric
.NET and Multimodal AI: Text-to-Image Capabilities
MEAI adds text-to-image generation in .NET, providing a consistent API that abstracts providers like Azure AI Foundry, OpenAI, and ONNX. This update prepares for future image-to-image and image-to-video support, making multimodal AI more accessible for .NET applications.
SharePoint and Microsoft 365: AI-Driven Content Intelligence
SharePoint's Knowledge Agent (public preview) delivers AI-powered automation for content metadata, summaries, document comparison, and rule creation, with workflow integration into Copilot. Controlled pilot programs, governance, review cycles, and training are emphasized for effective enterprise use.
Building and Operationalizing AI-Powered Agents
Developers continue to build practical agents, with a tutorial on creating a resilience coach using Azure OpenAI and Python. Additional resources show agent memory management with Semantic Kernel and Azure AI Search, alongside customization guides for LLMs and Cognitive Services. An operational workflow demonstrates post-call analytics using Azure OpenAI to process transcripts and feed CRM systems.
- Building a Resilience Coach with AI on Cozy AI Kitchen
- AI Agent Memory: Building Self-Improving Agents
- Generative AI in Azure: A Practical Guide to Getting Started
- From Call Transcripts to CRM Gold: AI-Powered Post-Call Intelligence
AI for Social Impact and Enterprise Architecture
UNHCR, Microsoft, and GitHub share new uses of drone data and open-source AI for sustainable planning in refugee settlements, showcasing adaptive open tools. Updated architecture frameworks now account for AI requirements, MLOps, and explainability. Sustainability remains a priority, with AI solutions for digital twins, forecasting, and energy-use reduction continuing the focus on practical environmental reliability.
- Using AI and Open Source to Map Refugee Settlements: The UNHCR and GitHub Story
- Software Architecture Frameworks and Artificial Intelligence: Building Smarter Systems
- Accelerating Sustainability and Resilience with AI-Powered Innovation
Other AI News
Research teams at Microsoft, Drexel University, and the Broad Institute present generative AI for rare disease diagnosis, utilizing Azure AI Foundry for evidence aggregation and collaborative genome review—a continuation of last week’s healthcare AI initiatives.