Private Preview: Azure Managed Prometheus for VM & VMSS Monitoring
Daramfon presents an introduction to Azure Managed Prometheus support for VMs and VMSS, now in private preview, highlighting unified monitoring, integration with Azure Monitor, and specific benefits for HPC and AI workloads.
Private Preview: Azure Managed Prometheus for VM & VMSS Monitoring
Azure Managed Prometheus now supports monitoring for traditional Virtual Machines (VM) and Virtual Machine Scale Sets (VMSS), expanding from its original focus on containerized workloads such as AKS and Azure Arc–enabled clusters. This private preview delivers several core enhancements:
Key Capabilities
- Unified Metric Collection: Integrate Prometheus-style monitoring across both containers and IaaS workloads (VMs/VMSS).
- HPC & AI Monitoring: Full support for GPU and InfiniBand metric collection, enabling advanced scenarios for high performance computing.
- Comprehensive Telemetry: Collect node-level (CPU, memory, storage, NIC, InfiniBand) and GPU-level (utilization, memory, throttling, ECC) metrics through standard Prometheus exporters.
- Managed Backend: Leverage a fully managed Prometheus backend with no server maintenance, auto-scaling, and zero operational storage management.
- Azure Monitor Integration: Store, query, and visualize metrics directly within Azure Monitor Workspaces.
- Visualization & Alerting: Use Azure Managed Grafana for pre-built dashboards, plus native support for PromQL and alerting rules.
- Mixed Fleet Observability: Monitor containerized (AKS), Arc-enabled, and IaaS resources in a single, unified environment.
Why It Matters for HPC Workloads
- AI/HPC Optimization: Enables detailed cluster performance visibility for teams managing GPU-accelerated infrastructure.
- Node & Cluster Views: Built-in dashboards and deep linking for fast navigation between cluster-level and node-level insights.
- Real-Time Querying: Execute live PromQL queries against Azure Monitor data sources.
How to Participate
- Access: Private preview currently requires allowlisting of your Azure subscription. Request access here.
- Getting Started: Use the step-by-step onboarding guide on GitHub for setup instructions.
- Feedback: Once onboarded, provide feedback and ask questions by opening issues in the provided GitHub repository.
Next Steps
- Begin scraping metrics, visualize with Azure Managed Grafana, and monitor both AKS and VM-based fleets.
- Leverage out-of-the-box dashboards and custom PromQL queries for observability tailored to HPC and AI environments.
This announcement was posted by Daramfon. As this is a private preview, user feedback is especially encouraged to shape the future direction of Azure’s managed monitoring platform.
This post appeared first on “Microsoft Tech Community”. Read the entire article here