winml-cli¶
Windows ML CLI is a command line tool for building portable, performant, and high-quality AI models for Windows ML. It takes you from a source model — whether from Hugging Face or your own pipeline — to a hardware-optimized artifact in a reproducible workflow.
Purpose-built for Windows hardware diversity, the CLI handles conversion, graph optimization, and compilation across AMD, Intel, NVIDIA, and Qualcomm targets. The CLI fits naturally into CI/CD pipelines so teams can validate and ship models easily.
What you can do¶
- Build once, run across hardwares. Compose your own workflow from primitive commands (
export,analyze,optimize,quantize,compile), or use an auto-generated config withwinml build— both produce portable models that run across hardware. - Drill into the details. Deep insights into operator compatibility, shape mismatches, graph optimizations, and EP-aware tuning at any stage of the pipeline.
- AI-ready. CLI-driven tools with built-in skills, friendly to work with mainstream agents.
What you get out of the box¶
- All Windows ML EPs supported. Every supported execution provider is available behind the same commands.
- Curated model catalog. A verified set of models that run across all Windows ML EPs — a reliable starting point.
- Bring your own ONNX. Not only for converting from PyTorch — bring an existing ONNX model to get operator-compatibility insights and optimize it based on the analysis.
Where to start¶
- Installation — get the
winmlCLI running locally. - Quickstart — export a Hugging Face model in five minutes.
Learn the model¶
- How winml-cli Works — the pipeline from a PyTorch model to an EP-compiled artifact.
- Commands — reference for all 12
winmlsubcommands. - Samples — walkthroughs for BERT and CLIP.
Repository access¶
To request access to the Windows ML CLI repository, visit aka.ms/winml-cli.
License¶
MIT. See LICENSE.