Why Microsoft Built Its Own Models
For years, Microsoft's AI products — Copilot, Azure AI services, GitHub Copilot — relied primarily on OpenAI models. At Build 2026, Microsoft AI CEO Mustafa Suleyman made the strategic shift explicit with the launch of the MAI model family — seven new models spanning image, voice, transcription, and reasoning.
The overarching vision Suleyman called "Humanist Superintelligence": AI explicitly designed to serve people and organizations, not replace them. And the central enterprise pitch for MAI-Thinking-1 is Zero Distillation: no output data from OpenAI, Anthropic, Google, or any other lab was used in training. Every capability was built from scratch using Microsoft's own datasets and RL pipeline — giving enterprise customers a clean, traceable, commercially licensed data lineage.
Architecture and Specifications
MAI-Thinking-1 is built on a sparse Mixture-of-Experts (MoE) architecture:
- Active parameters: 35 billion
- Total parameters: ~1 trillion (sparse MoE)
- Context window: 256K tokens
- Inference footprint: Competitive with models several times larger in the medium-size weight class
The sparse MoE design means inference cost is far lower than its total parameter count suggests. Microsoft's internal tests show that fine-tuning MAI models on McKinsey's task-specific data delivered the highest win rate — including outperforming GPT-5.5 — at 10x the cost efficiency.
Seven models announced: MAI Image 2.5 & Flash (text-to-image, #2 globally on image editing), MAI Transcribe 1.5 (speech-to-text, 43 languages, 5× faster than rivals), MAI Voice 2 & Flash (speech synthesis, 15 languages), MAI Thinking 1 (reasoning), and MAI Code 1 Flash (coding, only 5B params, 51% SWE-Bench Pro). All are rolling out via Microsoft Foundry.
Benchmark Comparison
| Model | SWE-Bench Pro | AIME 2025 | Active Params |
|---|---|---|---|
| MAI-Thinking-1 | 53.4% | 97.0% | 35B |
| Claude Opus 4.6 | ~53% | N/A | Undisclosed |
| MAI Code 1 Flash | 51% | — | 5B |
| Claude Sonnet 4.6 | Lower | N/A | Undisclosed |
MAI-Thinking-1 reaches frontier-level coding and math performance in the medium weight class, with inference costs that make it viable for everyday production use — not just occasional heavyweight tasks.
All MAI models ship with: ▲ Voice watermarking against unauthorized cloning ▲ Copyright protection ▲ An RL training loop that treats over-refusals and unsafe compliance as equally unacceptable defects — rather than trading one for the other. A detailed technical report was published alongside the launch.
Silicon Co-Design: Optimized for Maia 200
Microsoft co-designed MAI-Thinking-1 with its proprietary Maia 200 AI chip. Benchmarked head-to-head against NVIDIA's GB-200, the Maia 200 delivers a 1.4× performance-per-watt gain when running MAI models end-to-end — on top of the 30% performance improvement that CEO Satya Nadella separately cited for the platform. At cloud scale, every watt counts, and silicon-model co-design is a compounding advantage.
MAI models are also coming to Microsoft's next-gen AI PC platform N1X later this year, targeting on-device inference for Windows.
Frontier Tuning: Your Own Moat
Beyond the base model, Microsoft introduced Microsoft Frontier Tuning — the ability for enterprises to create custom agents using reinforcement learning environments (RLEs) tailored to their own workflows and data. The resulting fine-tuned model stays entirely within the customer's control, and only the customer benefits from the hard-won institutional knowledge encoded in their agents.
Unlike shared foundation model APIs, Microsoft's pitch is: "Only you keep the benefits of your own workflows, know-how, and data."
- MAI-Thinking-1 is Microsoft's first in-house reasoning model — matching Claude Opus 4.6 on SWE-Bench Pro (53.4%).
- Zero distillation from other AI labs: clean, commercially licensed data lineage for enterprise use.
- 35B active parameters (MoE) — far lower inference cost than its total 1T-parameter count implies.
- 97% on AIME 2025 — near-frontier math and reasoning for its weight class.
- Currently in private preview on Microsoft Foundry; MAI Playground public preview coming soon.