TL;DR: At Microsoft Build 2026, CEO Mustafa Suleiman announced seven new internally-developed MAI models: MAI-Thinking-1 (reasoning), MAI-Code-1-Flash (coding), MAI-Image-2.5 (image generation and editing), MAI-Transcribe-1.5 (speech-to-text), and MAI-Voice-2 (text-to-speech), plus Flash variants. Built from scratch without distilling from other labs, the family targets "Humanist Superintelligence."

Microsoft Goes All-In on Its Own Models

After years of backing OpenAI with billions in investment, Microsoft is now fielding its own frontier models. At Build 2026, Microsoft AI CEO Mustafa Suleiman unveiled seven MAI models built entirely in-house — no distillation from external labs, no opaque datasets. The models span reasoning, coding, image generation, voice synthesis, and transcription, positioning Microsoft as a serious contender in the foundation-model race.

📊 Key Numbers
  • MAI-Thinking-1 active parameters: 35B (MoE, ~1T total)
  • SWE-Bench Pro score: 53% (matches Claude Opus 4.6)
  • AIME 2025 score: 97.0%
  • MAI-Transcribe-1.5: 5× faster than competing transcription models
  • MAI-Code-1-Flash active parameters: 5B

The Full MAI Model Lineup

Model Category Highlights
MAI-Thinking-1 Reasoning / Text 35B active, 256K context, competitive with Opus 4.6 on SWE-Bench
MAI-Code-1-Flash Coding 5B active, deep GitHub Copilot / VS Code integration
MAI-Image-2.5 Image gen + edit Arena.ai #2 family, image-to-image editing supported
MAI-Image-2.5-Flash Image (lite) Faster, more cost-efficient flash variant
MAI-Transcribe-1.5 Transcription 43 languages, domain terminology, 5× speed advantage
MAI-Voice-2 Speech synthesis 15 languages, voice cloning from a short sample
MAI-Voice-2-Flash Speech (lite) Ultra-efficient low-cost variant, coming soon

MAI-Thinking-1: Microsoft's First LLM

MAI-Thinking-1 is Microsoft AI's debut large language model, purpose-built for reasoning, mathematics, and software engineering at enterprise scale. Its Mixture-of-Experts architecture keeps inference overhead lean — 35 billion active parameters out of roughly 1 trillion total — while delivering benchmark scores that punch far above its weight class.

Independent human raters on Surge preferred MAI-Thinking-1 over Sonnet 4.6 in blind side-by-side evaluations. On SWE-Bench Pro — the toughest publicly available coding benchmark — it scores 53%, matching Claude Opus 4.6. On AIME 2025 it reaches 97.0%, showing strong generalization in mathematical reasoning. With a starting price of $5 per million input tokens, it is also among the most cost-efficient options in its tier.

💡
Developer Note
MAI-Thinking-1 is available in private preview on Microsoft Foundry now; public preview on MAI Playground is coming soon. MAI-Code-1-Flash is already deeply integrated into GitHub Copilot CLI and VS Code, making it well-suited for agentic coding workflows. Models will also be available on OpenRouter, Fireworks, and Baseten — and for the first time, developers can fine-tune the weights directly.

Coding, Images, Voice — and Healthcare AI

MAI-Code-1-Flash targets the growing market for inference-efficient coding agents. At 5 billion active parameters, it is comparable to Haiku in capability but optimized specifically for Microsoft's stack, deeply embedded into GitHub Copilot and VS Code toolchains. MAI-Image-2.5 adds image-to-image editing and control-with-preservation to its text-to-image generation, landing at second place in the Arena.ai image generation leaderboard.

Beyond the model launch, Microsoft announced a partnership with Mayo Clinic to co-create a frontier AI model for healthcare, combining Mayo Clinic's clinical expertise and de-identified longitudinal data with Microsoft's foundational AI capabilities.

ℹ️
Availability
MAI-Thinking-1: private preview on Foundry now (request access). MAI-Image-2.5 and MAI-Transcribe-1.5: available directly in Foundry. MAI-Voice-2-Flash: coming soon. Developer weight fine-tuning is enabled for the first time across the MAI family.
Key Takeaways
  • Microsoft built seven frontier AI models from scratch, without distilling from third-party labs
  • MAI-Thinking-1 matches Claude Opus 4.6 on SWE-Bench Pro at a competitive $5/M-token price
  • Full multimodal coverage: reasoning, code, image generation and editing, voice, and transcription
  • Models available on OpenRouter and Fireworks; developer fine-tuning opened for the first time
  • Microsoft–Mayo Clinic healthcare AI collaboration announced alongside the model family
🔗
Official Sources & Resources
Microsoft AI: Full MAI Model Launch Blog Post
Introducing MAI-Thinking-1 — Official Announcement
MAI-Thinking-1 Model Specs & Access Request
Azure AI Foundry Blog: MAI Model Availability