NVIDIA Alpamayo 2 Super: A 32B Open Reasoning VLA Model for Level 4 Robotaxi Development

At GTC Taipei, NVIDIA unveiled Alpamayo 2 Super — a 32-billion-parameter open reasoning Vision-Language-Action model for Level 4 autonomous vehicle development — alongside AlpaGym, OmniDreams, and a full AV development pipeline.

TL;DR: NVIDIA unveiled Alpamayo 2 Super at GTC Taipei — a 32B-parameter open reasoning Vision-Language-Action model targeting Level 4 robotaxi development. Alongside it, NVIDIA released AlpaGym (a closed-loop RL training framework), OmniDreams (a photorealistic AV scenario generation world model), and a CoC Auto-Labeling Pipeline, forming a complete data-to-deployment pipeline for autonomous vehicle development.

Overview: NVIDIA Goes All-In on Open AV AI

On June 1, 2026, NVIDIA took the stage at GTC Taipei to announce Alpamayo 2 Super, its most capable open reasoning model to date. This 32-billion-parameter Vision-Language-Action (VLA) model is built specifically for the full autonomous driving stack — reasoning, planning, and acting across the entire driving pipeline for safer, scalable Level 4 development.

Rather than releasing a standalone model, NVIDIA delivered a complete autonomous vehicle development ecosystem. Alpamayo 2 Super sits at the center, flanked by AlpaGym for closed-loop reinforcement learning, OmniDreams for photorealistic scenario generation, and new physical AI agent skills under NVIDIA Agent Toolkit.

Alpamayo 2 Super at a Glance

🧠 Parameters: 32 billion
🚗 Target Application: Level 4 autonomous robotaxi development
📂 Release Model: Open source (GitHub inference code + Hugging Face weights)
🗓️ Availability: Summer 2026
🔗 Architecture: Unified Vision-Language-Action (VLA) model

What Is a VLA Model — and Why Does It Matter for AVs?

A Vision-Language-Action (VLA) model unifies three capabilities in a single architecture: understanding visual input (camera feeds, sensor data), reasoning about situations in language space, and generating concrete actions (steering, braking, acceleration commands). This is a departure from the traditional modular AV stack where perception, planning, and control are separate pipelines.

Alpamayo 2 Super is trained using Chain-of-Causality (CoC) labeling — a method that grounds each driving decision in its causal context, helping the model understand why specific actions are taken in specific situations, not just that they were taken. This causal grounding is what separates genuine reasoning-based driving from pattern-matched behavior.

For Developers
Alpamayo 2 Super will be freely available this summer via GitHub (inference code) and Hugging Face (model weights). NVIDIA's physical AI agent skills under Agent Toolkit can automate simulation, synthetic data generation, and closed-loop RL training workflows — meaning teams can build and validate AV systems end-to-end without heavy manual infrastructure.

AlpaGym: Learn From Consequences Before Hitting the Road

AlpaGym is NVIDIA's open-source, high-throughput closed-loop reinforcement learning framework for autonomous driving. The key distinction from open-loop training: rather than evaluating a model against recorded data (open-loop), AlpaGym runs models through continuous decision-observation cycles where every braking, steering, and navigation choice actively affects the simulation environment.

Built on the AlpaSim microservice simulation stack and NVIDIA Omniverse NuRec, AlpaGym enables efficient, scalable closed-loop RL. It provides Alpamayo with a continuous development path — from open-loop pre-training on the Physical AI AV Dataset to closed-loop fine-tuning that pushes the frontier of driving performance before any real-world deployment.

OmniDreams: Generating the Scenarios That Don't Exist Yet

One of the hardest problems in AV development is the long tail — rare but critical driving scenarios that happen infrequently in the real world but are disproportionately dangerous. OmniDreams is NVIDIA's generative world model designed to solve this by producing photorealistic, closed-loop scenarios of these rare events at scale.

Rather than collecting millions of miles of real-world data hoping to encounter edge cases, development teams can use OmniDreams to synthetically generate and train on exactly the scenarios they need — pedestrians in fog, unexpected debris, multi-vehicle near-misses — with photorealistic fidelity that transfers to real deployment environments.

Component	Role	Key Feature
Alpamayo 2 Super	Unified VLA reasoning model	32B params, open-source
AlpaGym	Closed-loop RL training	Continuous decision-consequence cycles
OmniDreams	Photorealistic scenario generation	Long-tail and rare-event specialization
NuRec Neural Reconstruction	Real-world data → 3D scene reconstruction	Multi-sensor configuration support
CoC Auto-Labeling Pipeline	Causal label generation from raw clips	No human annotation required

CoC Auto-Labeling Pipeline
NVIDIA is also open-sourcing the CoC Auto-Labeling Pipeline on GitHub. It automatically generates decision-grounded and causally linked CoC labels from raw driving clips without human annotation — providing the foundational causal training data needed to train embodied reasoning models at scale.

Key Takeaways

Alpamayo 2 Super is NVIDIA's most powerful open model — 32B VLA reasoning architecture for Level 4 robotaxi development, coming this summer
AlpaGym closes the loop: closed-loop RL lets models learn from the consequences of their driving decisions in simulation
OmniDreams generates photorealistic long-tail scenarios at scale — no real-world mileage required
CoC Auto-Labeling Pipeline eliminates human annotation bottlenecks for causal training data
Full pipeline integrates data capture → simulation → training → in-vehicle deployment in a single ecosystem

Strategic Context: Why Open AV AI Matters

The open-source release of Alpamayo 2 Super carries significant strategic weight. Unlike the proprietary systems behind Waymo or Tesla, NVIDIA is creating a foundation that developers, researchers, and AV startups can freely build on and improve. Distribution through platforms like OpenRouter, Fireworks, and Baseten further democratizes access.

For NVIDIA, the move is a clear extension of its hardware dominance into the software AI stack. As the AV market accelerates toward Level 4 commercialization, controlling the foundational model ecosystem creates compounding advantages — teams that build on NVIDIA's AV stack naturally gravitate toward NVIDIA's Drive hardware and cloud infrastructure.

The broader industry signal: the era of black-box AV AI is giving way to open, auditable, reason-tracing models. For a sector where safety and regulatory scrutiny are paramount, that transparency may prove as commercially important as raw benchmark performance.

Related Reading · Official Sources
· NVIDIA Self-Driving (DRIVE)
· NVIDIA Developer Blog
· NVIDIA GTC