Nvidia has announced significant new infrastructure and AI models, reinforcing its commitment to building the foundational technology for physical AI. This includes robots and autonomous vehicles capable of perceiving and interacting with the real world, marking a pivotal step in accelerating research for self-driving technology and intelligent systems.
At the NeurIPS AI conference in San Diego, California, the semiconductor giant unveiled Alpamayo-R1, an open reasoning vision language model specifically designed for autonomous driving research. Nvidia highlights Alpamayo-R1 as the first vision language action model focused on self-driving applications. Vision language models are crucial for autonomous systems, as they can process both text and images, enabling vehicles to "see" their surroundings and make informed decisions based on their perceptions.
Alpamayo-R1 builds upon Nvidia's existing Cosmos Reason model, a sophisticated reasoning model that processes decisions before generating responses. The Cosmos model family was initially introduced in January 2025, with further models released in August of the same year.
Nvidia emphasizes the critical role of technologies like Alpamayo-R1 in helping companies achieve Level 4 autonomous driving. This level signifies full autonomy within defined operational design domains and under specific conditions, as detailed in a company blog post. The company aims for these reasoning models to imbue autonomous vehicles with the "common sense" needed to navigate complex driving scenarios, much like human drivers.
The new Alpamayo-R1 model is now publicly available on GitHub and Hugging Face. Complementing this release, Nvidia has also published the "Cosmos Cookbook" on GitHub. This collection includes new step-by-step guides, inference resources, and post-training workflows designed to assist developers in effectively utilizing and training Cosmos models for their specific applications. The cookbook covers essential aspects such as data curation, synthetic data generation, and model evaluation.
These announcements underscore Nvidia's aggressive expansion into physical AI, positioning it as a key growth area for its advanced AI GPUs. Jensen Huang, Nvidia's co-founder and CEO, has consistently emphasized that physical AI represents the "next wave" of artificial intelligence. This sentiment was echoed by Bill Dally, Nvidia's chief scientist, in a conversation with TechCrunch, where he highlighted the importance of physical AI in robotics.
"I think eventually robots are going to be a huge player in the world and we want to basically be making the brains of all the robots," Dally stated. "To do that, we need to start developing the key technologies."
This vision drives Nvidia's ongoing investment in developing the core technologies that will power future intelligent physical systems.







