Nvidia announced today that it is releasing a family of foundational AI models called Cosmos that can be used for training identicalindustrial robots, and self-driving carsWhile language models learn to generate text by training on abundant books, articles, and social media posts, Cosmos is designed to generate images and 3D models of the physical world.
During the keynote presentation at the annual CES conference in Las Vegas, Nvidia's CEO jensen huang Cosmos examples were shown to simulate activities inside warehouses. Cosmos was trained on 20 million hours of real footage of “humans walking, moving hands, manipulating things,” Jensen said. “It's not about producing creative content, but about teaching AI to understand the physical world.”
Researchers and startups hope that this type of fundamental model can Provide robots used in factories and homes More sophisticated capabilities. For example, Cosmos can generate realistic video footage of boxes falling from shelves inside a warehouse, which can be used to train robots to recognize accidents. Users can also improve the models using their own data.
Nvidia says several companies are already using Cosmos, including humanoid robot startups Agility and Figure AI, as well as self-driving car companies like Uber, Wabi, and Wave.
Nvidia also announced software designed to help different types of robots learn to perform new tasks more efficiently. The new feature is part of Nvidia's existing Isaac robot simulation platform that will allow robot builders to take a small number of examples of a desired task, such as grasping a particular object, and generate large amounts of synthetic training data.
Nvidia hopes Cosmos and Isaac will appeal to companies that want to build and use humanoid robots. Jensen was joined on stage at CES by life-size images of 14 different humanoid robots developed by companies including Tesla, Boston Dynamics, Agility and Figma.
Nvidia also announced with Cosmos Project Digits, a $3,000 “personal AI supercomputer” Which can run a large language model of up to 200 billion parameters without the need for cloud services like AWS or Microsoft. It also announced its highly anticipated next-generation RTX Blackwell GPUs and upcoming software tools to help build AI Agent,