Browse DevOps Community (228)

RohitMadhavKrishnan introduces ArchAngel, an educational AI coding assistant designed to bring a team’s engineering standards directly into the IDE, so junior developers get constructive feedback while they write code. The post outlines the core idea, a reference architecture, and the Microsoft-centric stack used to ground guidance in “golden repos.”
Dirk Brinkmann shows how to turn Azure Savings Plan recommendations into defensible, hour-by-hour data by exporting the underlying PAYG usage series and alternative commitment levels from the Azure Cost Management Benefit Recommendations API, using a companion PowerShell script that outputs CSV, Markdown, and JSON files.
viviandiec announces general availability of OpenTelemetry (OTel) Guest OS metrics for Azure VMs and Arc-enabled Servers, plus an updated Azure Monitor VM experience. The post explains what metrics are available, how OTel compares to Log Analytics-based metrics, and how to use PromQL and Grafana dashboards for troubleshooting at scale.
Sokuma announces the general availability of Service Level Indicators (SLIs) and Service Level Objectives (SLOs) in Azure Monitor, outlining how teams can track customer-experience reliability with SLI authoring, SLO tracking, error budgets, and burn rate–based alerting in a single Azure Monitor workflow.
Sokuma announces the general availability of Azure Monitor Metrics Export using data collection rules (DCRs), highlighting how to continuously stream platform metrics to Azure Storage, Event Hubs, or Log Analytics with multidimensional metrics support, metric-name filtering, and typical end-to-end latency of about three minutes.
TulikaC introduces new Azure CLI commands for listing and viewing Azure App Service for Linux startup logs, making it easier to diagnose container initialization issues, runtime startup failures, warmup probe problems, and slot-specific startup behavior directly from the command line.
BhaktiRath95 breaks down why Azure Container Apps can feel “slow to start” in production, separating true cold starts from scaling delays and resource throttling. It includes concrete fixes like minReplicas tuning, KEDA rule adjustments, probe configuration, image-size reduction, and practical .NET and Django startup optimizations backed by Log Analytics and Application Insights queries.
j_folberth explains how to deploy Azure AI Foundry Hosted Agents directly from a source-code ZIP instead of a container image, including the deployment lifecycle, an azd-based workflow, and a reusable GitHub Action that posts to the Foundry data plane and polls until the new agent version becomes active.
Mahesh Sundaram announces a public preview in Azure Monitor that lets platform teams collect Azure resource platform logs at scale using Data Collection Rules (DCRs), replacing per-resource diagnostic settings with a centralized, policy-driven model that supports governance, cost control, and modern identity-based access.
Heather Poulsen shares an event session overview on designing Azure AI Landing Zones as a production-ready foundation for deploying AI applications and AI agents at scale, with guardrails for networking, identity, security, governance, and cost control using Microsoft’s recommended architecture frameworks.
brauerblogs announces a two-day “Path to Production for Agents” webinar series (July 27–28) focused on moving agentic AI from prototypes to production, covering governance, landing-zone architecture, AgentOps practices, security risks like prompt injection, and cost/performance optimization with Azure Monitor and Microsoft Foundry.
BhaktiRath95 walks through common startup and deployment failures in Azure Container Apps and Container App Jobs for .NET and Django workloads, showing what the errors look like in logs, why they happen, and the concrete CLI, configuration, and code changes that fix them.
Mayunk Jain summarizes the Azure App Service announcements from Microsoft Build 2026, including a new “Easy AI experience” with built-in MCP, GA of Isolated v4 for App Service Environments, and Managed Instance improvements for modernizing legacy apps (including IIS) with better diagnostics and deployment workflows.
sunayanasingh explains how Azure Monitor now supports exemplars so teams can jump from Prometheus/OpenTelemetry metric spikes to the exact OpenTelemetry trace in Application Insights, using Azure Managed Grafana for visualization and trace linking.
Johnson Shi provides an operational guide to running a geo-replicated Azure Container Registry (ACR) for high availability, explaining how global endpoints, regional endpoints, and dedicated data endpoints behave during incidents, throttling, and DNS changes, with concrete Azure CLI steps for setup, routing control, and troubleshooting.
Sally Dabbah explains how to turn Synapse/ADF/Microsoft Fabric pipeline failures into structured, queryable telemetry by sending standardized failure events into Azure Monitor Log Analytics via the Logs Ingestion API and a Data Collection Rule, enabling KQL-based analysis, alerting, and reliability reporting across environments and datasets.
anandranjan explains a practical AKS pattern for keeping secret values out of YAML and CI/CD by using Azure Key Vault with the Secrets Store CSI Driver and AKS Workload Identity. It covers the identity flow, required AKS/Azure setup, workload onboarding YAML, and common troubleshooting points around federation, labels, mounts, and permissions.
Alex-wdy explains how Azure CLI 2.86.0+ speeds up slow enterprise-scale az login by skipping post-auth subscription enumeration across many tenants and subscriptions. The post introduces --skip-subscription-discovery (and --skip-sub), targeted --subscription on login, and when to use (or avoid) these flags.
Johnson Shi, Zoey (Zhuyu) Li, and Huangli Wu announce public preview support for regional endpoints in Azure Container Registry geo-replication, including the new Azure CLI and portal experience, endpoint URL formats, and practical guidance for pinning pushes/pulls and Kubernetes workloads to specific replicas.
shijain13 explains what’s new in the Azure Monitor Health Model (Preview), focusing on expanded discovery options, faster health signal setup, and new aggregation rules that help teams reason about workload health with less alert noise and clearer troubleshooting paths.
leoyao summarizes the //build 2026 updates to Foundry Toolkit for VS Code, focusing on an end-to-end Hosted Agent workflow (scaffold, run, deploy, observe), richer Toolbox integrations, and new LangGraph samples that cover MCP, human-in-the-loop flows, and production observability.
Ram Kakani explains how Oracle Managed Database MCP (Model Context Protocol) remote servers can be used from Microsoft Foundry to build enterprise AI agents that query Oracle AI Database@Azure, including local VS Code workflows, self-hosted Azure deployments, and a fully managed OCI option with identity, networking, and governance controls.
LZhang lays out a practical DevOps loop for Microsoft Foundry Hosted Agents, covering how to move from Terraform-provisioned infrastructure to production delivery with immutable agent versions, evaluation as a release gate, manifest-driven promotion, traffic-split canaries, and per-version observability.
Ivan Varnitski announces a public preview feature for Azure Monitor Data Collection Rules that lets you run multi-stage transformations (processors) to filter, aggregate, parse, and reshape logs before they’re ingested into Log Analytics, cutting ingestion volume and cost while improving query-ready data quality.
susaraswat4 shares performance and sizing guidance for Azure Monitor pipeline, including measured Syslog/CEF ingestion throughput into Log Analytics, memory footprint, and how throughput scales with vCPUs and replicas. It also highlights operational behaviors like automatic core usage and TCP backpressure as a signal to scale.
kinfey breaks down a cost- and security-aware blueprint for running a multi-agent SDLC “tower” on AKS, using AI Runway for in-cluster model serving, Kata MicroVM isolation for each agent pod, and MCP so GitHub Copilot Chat can orchestrate tools while keeping token spend predictable.
j_folberth shows how to deploy a new version of an Azure AI Foundry Hosted Agent using a repeatable GitHub composite action, including required workflow prerequisites, action inputs/outputs, and a Bash + Azure CLI + jq approach for calling the Foundry data-plane REST endpoint safely.
MattMc announces new Azure Monitor capabilities for observing AI agents, including faster telemetry ingestion, larger event payloads for prompts/responses, an Agents fleet view, deeper end-to-end transaction debugging, and evaluation workflows (including human-in-the-loop annotations) across different hosting environments and frameworks.
j_folberth walks through what it takes to deploy an Azure AI Foundry Hosted Agent using the Foundry Service REST API, including the required Azure resources, container build/push flow, and the RBAC and managed identity setup needed for the Foundry project to pull images and create agent versions.
Akash Singhal and co-authors explain how Azure Container Registry (ACR) Artifact Cache implements pull-through caching at ACR scale, including the control-plane resources customers configure and the data-plane request path that serves pulls immediately while hydrating content asynchronously for later local serving.
Nathan Mangum introduces new preview capabilities in Azure Monitor Monitoring Coverage that help teams validate whether monitoring is correctly enabled and whether telemetry is actually flowing for VMs and AKS, and to close alerting gaps by enabling Microsoft-recommended alerts at scale.
Anthony Chu announces public preview support for Go as a first-class language in Azure Functions on the Flex Consumption plan, including a new code-first Go SDK, supported triggers (HTTP, Timer, Service Bus, Event Hubs, Event Grid, Cosmos DB, Blob), local tooling, and deployment options.

What’s new in Observability at Build 2026

Priyanka Nanda summarizes the Build 2026 updates for Azure Monitor, including new agent observability features, the Azure Copilot Observability agent, expanded OpenTelemetry/OTLP ingestion, and improvements to alerts, metrics querying, and SLI/SLO tracking across services like AKS and Application Insights.

Azure Functions at Build 2026 Update

nzthiago summarizes the Build 2026 wave of Azure Functions updates, covering a new serverless agents runtime, first-class managed connectors, MCP improvements, refreshed local tooling (Functions CLI v5 and VS Code templates), Go support on Flex Consumption, Durable Task Scheduler enhancements, and new operational/security features like built-in Grafana dashboards and TLS certificates.
beenamore summarizes the Microsoft Build 2026 updates for Azure API Management and Azure API Center, focusing on governing AI-era API ecosystems: registering and assessing agents, MCP-based discovery, agent-to-agent APIs, a unified model API for multi-model apps, expanded AI gateway controls, and deeper token observability.
lily-ma introduces Hosted MCP Servers in Azure Logic Apps Connector Namespace (public preview), a managed way to deploy remote MCP endpoints from a catalog so AI agents can discover and call tools without you owning the underlying infrastructure, scaling, authentication, or monitoring setup.
coryskimming summarizes the Azure Kubernetes Service (AKS) announcements from Microsoft Build 2026, focusing on running AI training and inference at scale. It covers new options for cluster operations, bare-metal performance, fleet management across Arc-enabled clusters, and Kubernetes-native model serving with tools like KAITO and AI Runway.
Brendan Burns announces the public preview of Anyscale on Azure, a managed Ray platform that runs on Azure Kubernetes Service (AKS). The post focuses on scaling distributed AI training and inference across regions, simplifying operations via Azure-native provisioning and billing, and using Microsoft Entra workload identity for governance.
Vincent Liu announces general availability of a new Azure Files management experience for premium SSD NFS file shares, enabling each share to be created, secured, scaled, and billed independently. The post highlights higher scale limits, IaC support with Bicep/ARM, share-level networking and security boundaries, and snapshot-based data protection.
Mike Hulme introduces Microsoft’s “agentic modernization” approach, combining Azure Copilot migration agent and the GitHub Copilot modernization agent to help teams plan and execute large-scale application modernization, from estate discovery and dependency mapping through code transformation, PR-based execution, and governed rollout on Azure.

Rejoining the server...

Rejoin failed... trying again in seconds.

Failed to rejoin.
Please retry or reload the page.

The session has been paused by the server.

Failed to resume the session.
Please reload the page.