Browse All Community (431)

RohitMadhavKrishnan introduces ArchAngel, an educational AI coding assistant designed to bring a team’s engineering standards directly into the IDE, so junior developers get constructive feedback while they write code. The post outlines the core idea, a reference architecture, and the Microsoft-centric stack used to ground guidance in “golden repos.”
BhaktiRath95 walks through common failure modes when running AI/ML inference workloads on Azure Container Apps, including slow model startup, probe timeouts, OOM kills, and GPU initialization problems. The post provides concrete probe settings, Python/FastAPI patterns, and Log Analytics queries to diagnose and fix issues methodically.
Dirk Brinkmann shows how to turn Azure Savings Plan recommendations into defensible, hour-by-hour data by exporting the underlying PAYG usage series and alternative commitment levels from the Azure Cost Management Benefit Recommendations API, using a companion PowerShell script that outputs CSV, Markdown, and JSON files.
viviandiec announces general availability of OpenTelemetry (OTel) Guest OS metrics for Azure VMs and Arc-enabled Servers, plus an updated Azure Monitor VM experience. The post explains what metrics are available, how OTel compares to Log Analytics-based metrics, and how to use PromQL and Grafana dashboards for troubleshooting at scale.
Sokuma announces the general availability of Service Level Indicators (SLIs) and Service Level Objectives (SLOs) in Azure Monitor, outlining how teams can track customer-experience reliability with SLI authoring, SLO tracking, error budgets, and burn rate–based alerting in a single Azure Monitor workflow.
Sokuma announces the general availability of Azure Monitor Metrics Export using data collection rules (DCRs), highlighting how to continuously stream platform metrics to Azure Storage, Event Hubs, or Log Analytics with multidimensional metrics support, metric-name filtering, and typical end-to-end latency of about three minutes.
Sunita_AZ0708 explains how to run Ansys Discovery on Azure using NVads V710 v5 GPU VMs, including a reference architecture, right-sizing guidance for fractional GPUs, and validation results across fluid, thermal, and structural simulation scenarios.
Rafia Aqil explains how to diagnose and respond when Azure Databricks clusters can’t start or scale due to Azure regional VM capacity constraints, including what to send to Microsoft support, which VM families to switch to, and longer-term design choices like instance pools, serverless compute, and multi-region deployments.
ShubhamSachdeva99 explains how to switch built-in connector connections at runtime in Azure Logic Apps Standard by making the service provider action’s connectionName dynamic, enabling a single workflow to route to different SFTP/SQL/Service Bus endpoints per team or environment.
TulikaC introduces new Azure CLI commands for listing and viewing Azure App Service for Linux startup logs, making it easier to diagnose container initialization issues, runtime startup failures, warmup probe problems, and slot-specific startup behavior directly from the command line.
BhaktiRath95 breaks down why Azure Container Apps can feel “slow to start” in production, separating true cold starts from scaling delays and resource throttling. It includes concrete fixes like minReplicas tuning, KEDA rule adjustments, probe configuration, image-size reduction, and practical .NET and Django startup optimizations backed by Log Analytics and Application Insights queries.
j_folberth explains how to deploy Azure AI Foundry Hosted Agents directly from a source-code ZIP instead of a container image, including the deployment lifecycle, an azd-based workflow, and a reusable GitHub Action that posts to the Foundry data plane and polls until the new agent version becomes active.
Mahesh Sundaram announces a public preview in Azure Monitor that lets platform teams collect Azure resource platform logs at scale using Data Collection Rules (DCRs), replacing per-resource diagnostic settings with a centralized, policy-driven model that supports governance, cost control, and modern identity-based access.
Heather Poulsen shares an optimization playbook for running agentic AI workloads in production on Azure, focusing on keeping multi-agent orchestration reliable while controlling token costs and latency. It highlights practical techniques like inference routing, prompt compression, RAG tuning, caching, and FinOps-style capacity planning.
Heather Poulsen outlines a governance-first blueprint for building scalable agentic AI systems, focusing on how to embed consistent controls and quality checks across user interactions, agent orchestration, integrations, data, and models so systems can scale without losing trust and oversight.
Heather Poulsen shares an event session overview on designing Azure AI Landing Zones as a production-ready foundation for deploying AI applications and AI agents at scale, with guardrails for networking, identity, security, governance, and cost control using Microsoft’s recommended architecture frameworks.
Rafia_Aqil outlines a reference architecture for ingesting both streaming and batch data through Microsoft Fabric into Azure Databricks, using OneLake/ADLS and a medallion (Bronze/Silver/Gold) layout. The post breaks down five Fabric-to-Databricks integration paths and calls out security, governance, and monitoring considerations.
brauerblogs announces a two-day “Path to Production for Agents” webinar series (July 27–28) focused on moving agentic AI from prototypes to production, covering governance, landing-zone architecture, AgentOps practices, security risks like prompt injection, and cost/performance optimization with Azure Monitor and Microsoft Foundry.
BhaktiRath95 walks through common startup and deployment failures in Azure Container Apps and Container App Jobs for .NET and Django workloads, showing what the errors look like in logs, why they happen, and the concrete CLI, configuration, and code changes that fix them.
Mayunk Jain summarizes the Azure App Service announcements from Microsoft Build 2026, including a new “Easy AI experience” with built-in MCP, GA of Isolated v4 for App Service Environments, and Managed Instance improvements for modernizing legacy apps (including IIS) with better diagnostics and deployment workflows.
sunayanasingh explains how Azure Monitor now supports exemplars so teams can jump from Prometheus/OpenTelemetry metric spikes to the exact OpenTelemetry trace in Application Insights, using Azure Managed Grafana for visualization and trace linking.
Johnson Shi provides an operational guide to running a geo-replicated Azure Container Registry (ACR) for high availability, explaining how global endpoints, regional endpoints, and dedicated data endpoints behave during incidents, throttling, and DNS changes, with concrete Azure CLI steps for setup, routing control, and troubleshooting.
shashankamalladi announces General Availability of Network Security Perimeter (NSP) support for Azure Service Bus, including availability in Azure Government regions. The post explains how NSP provides a centralized security boundary with default-deny communication, explicit inbound/outbound rules, and diagnostic logging for audit and compliance.
jordanselig announces a public preview feature that lets Azure App Service expose an existing REST API as a Model Context Protocol (MCP) server using only an OpenAPI spec. The post covers how the platform generates MCP tools, how to configure it, and what to consider for authentication and safe exposure.
Sally Dabbah explains how to turn Synapse/ADF/Microsoft Fabric pipeline failures into structured, queryable telemetry by sending standardized failure events into Azure Monitor Log Analytics via the Logs Ingestion API and a Data Collection Rule, enabling KQL-based analysis, alerting, and reliability reporting across environments and datasets.
anandranjan explains a practical AKS pattern for keeping secret values out of YAML and CI/CD by using Azure Key Vault with the Secrets Store CSI Driver and AKS Workload Identity. It covers the identity flow, required AKS/Azure setup, workload onboarding YAML, and common troubleshooting points around federation, labels, mounts, and permissions.
Alex-wdy explains how Azure CLI 2.86.0+ speeds up slow enterprise-scale az login by skipping post-auth subscription enumeration across many tenants and subscriptions. The post introduces --skip-subscription-discovery (and --skip-sub), targeted --subscription on login, and when to use (or avoid) these flags.
amolravande explains how to run agent-generated Python safely by combining Agent Governance Toolkit (AGT) policy enforcement with Azure Container Apps Sandboxes, using per-session microVM isolation plus a fail-closed egress proxy to reduce the blast radius of untrusted code.
Johnson Shi, Zoey (Zhuyu) Li, and Huangli Wu announce public preview support for regional endpoints in Azure Container Registry geo-replication, including the new Azure CLI and portal experience, endpoint URL formats, and practical guidance for pinning pushes/pulls and Kubernetes workloads to specific replicas.
shijain13 explains what’s new in the Azure Monitor Health Model (Preview), focusing on expanded discovery options, faster health signal setup, and new aggregation rules that help teams reason about workload health with less alert noise and clearer troubleshooting paths.
Sam Foo explains how Pod CIDR expansion works for Azure CNI Overlay in Azure Kubernetes Service (AKS), and what to consider when planning pod IP ranges for long-lived clusters as they scale.
kinfey explains how to run LLM agents that write and execute code without giving them a host-sized blast radius, using a MicroVM sandbox. The post walks through a real pipeline (a daily Mandarin World Cup podcast) built with Microsoft Agent Framework, Azure AI Foundry, and Hyperlight snapshot/restore isolation.
leoyao summarizes the //build 2026 updates to Foundry Toolkit for VS Code, focusing on an end-to-end Hosted Agent workflow (scaffold, run, deploy, observe), richer Toolbox integrations, and new LangGraph samples that cover MCP, human-in-the-loop flows, and production observability.
Ram Kakani explains how Oracle Managed Database MCP (Model Context Protocol) remote servers can be used from Microsoft Foundry to build enterprise AI agents that query Oracle AI Database@Azure, including local VS Code workflows, self-hosted Azure deployments, and a fully managed OCI option with identity, networking, and governance controls.
LZhang lays out a practical DevOps loop for Microsoft Foundry Hosted Agents, covering how to move from Terraform-provisioned infrastructure to production delivery with immutable agent versions, evaluation as a release gate, manifest-driven promotion, traffic-split canaries, and per-version observability.
mmcrey announces Confidential Live Migration for Intel TDX Confidential VMs in Azure, explaining how Azure can move a running confidential VM to updated infrastructure with limited interruption while protecting VM memory and execution context through attestation, policy checks, and encrypted state transfer.
Connected-Seth outlines new Azure Event Grid Namespace capabilities for IoT and event-driven systems, including MQTT v5 Subscription Identifiers (GA), larger 1MB event payloads (GA, coming soon), and autoscale up/down (preview, coming soon), plus a GA integration for routing Stripe events into Azure services.
Ivan Varnitski announces a public preview feature for Azure Monitor Data Collection Rules that lets you run multi-stage transformations (processors) to filter, aggregate, parse, and reshape logs before they’re ingested into Log Analytics, cutting ingestion volume and cost while improving query-ready data quality.
susaraswat4 shares performance and sizing guidance for Azure Monitor pipeline, including measured Syslog/CEF ingestion throughput into Log Analytics, memory footprint, and how throughput scales with vCPUs and replicas. It also highlights operational behaviors like automatic core usage and TCP backpressure as a signal to scale.
madhurinrao introduces Azure Copilot Migration Agent, a guided workflow in Azure Migrate that connects discovery, assessment, planning, and execution for storage migrations—covering SMB/NFS file shares to Azure Files and Azure Blob container-to-container transfers.

Rejoining the server...

Rejoin failed... trying again in seconds.

Failed to rejoin.
Please retry or reload the page.

The session has been paused by the server.

Failed to resume the session.
Please reload the page.