Expand local AI reach with Windows ML | OD851

Andrew Leader and Maha Bayana explain how Windows ML enables local AI apps on Windows using custom or open-source ONNX models, with a focus on running inference efficiently across CPU, GPU, and NPU. They also cover what’s new, including WebNN support for web scenarios and improved tooling via AI Toolkit for VS Code.

Overview

Windows ML lets developers build local, AI-powered applications on Windows that run models efficiently across available hardware (GPU, NPU, and CPU) using a unified platform.

The session highlights:

Session outline (from chapters)

Introduction: cloud costs and benefits of local AI

Windows ML overview and supported ONNX models

Compatibility and language support

Demo scenario: sentiment analysis dashboard

Model conversion with Windows ML CLI

Web app integration

Web inference via WebNN

Performance optimization for NPUs

Moving to a native Windows app

Resources