Google is building its own 'world modeling' AI team for games and robot training
Google DeepMind is creating a team to develop AI "world models" for simulating environments, enhancing video games and robot training, while competing with OpenAI and Nvidia in the AGI race.
Read original articleGoogle DeepMind is establishing a new team focused on developing "world models," which are AI systems designed to simulate physical environments. This initiative, led by Tim Brooks, a former co-lead of OpenAI’s Sora project, aims to create AI models that can enhance video games, movies, and robot training scenarios. The team will work on scaling pretraining using video and multimodal data, which DeepMind believes is essential for achieving artificial general intelligence (AGI). The project is part of a broader competitive landscape in AI, where companies like OpenAI and Nvidia are also advancing their own technologies. DeepMind's world models will integrate with existing projects, including the Gemini AI models and the Veo video generator, to support applications in visual reasoning, planning for robots, and real-time interactive entertainment. As the race for AGI intensifies, Google’s focus on world modeling reflects its ambition to lead in this emerging field.
- Google DeepMind is forming a team to develop AI "world models" for simulating physical environments.
- The initiative is led by Tim Brooks, previously of OpenAI, and aims to enhance video games and robot training.
- DeepMind believes scaling pretraining on multimodal data is crucial for achieving artificial general intelligence (AGI).
- The project will complement existing Google AI efforts, including Gemini and Veo.
- The competition for AGI is intensifying, with other companies like OpenAI and Nvidia also making significant advancements.
Related
Gemini Pro 1.5 experimental "version 0801" available for early testing
Google DeepMind's Gemini family of AI models, particularly Gemini 1.5 Pro, excels in multimodal understanding and complex tasks, featuring a two million token context window and improved performance in various benchmarks.
A co-lead on Sora, OpenAI's video generator, has left for Google
Tim Brooks has left OpenAI to join Google DeepMind, focusing on video generation technologies. His departure follows a trend of high-profile resignations from OpenAI amid challenges faced by the Sora project.
Pushing the Frontiers of Audio Generation
Google DeepMind has advanced audio generation technology, enabling natural digital interactions and long-form dialogues. Their latest model improves efficiency and quality while emphasizing responsible AI development and future integration with other media.
Demis Hassabis:'We will need a handful of breakthroughs before we reach AGI'
Demis Hassabis, CEO of Google DeepMind, believes achieving artificial general intelligence requires multiple breakthroughs. He emphasizes AI's benefits in scientific research, particularly through AlphaFold, despite environmental concerns.
Genie 2: A large-scale foundation world model
Google DeepMind's Genie 2 is a foundation model that creates diverse 3D environments from a single image, enhancing AI training and research towards general artificial intelligence through complex interactions and memory capabilities.
Related
Gemini Pro 1.5 experimental "version 0801" available for early testing
Google DeepMind's Gemini family of AI models, particularly Gemini 1.5 Pro, excels in multimodal understanding and complex tasks, featuring a two million token context window and improved performance in various benchmarks.
A co-lead on Sora, OpenAI's video generator, has left for Google
Tim Brooks has left OpenAI to join Google DeepMind, focusing on video generation technologies. His departure follows a trend of high-profile resignations from OpenAI amid challenges faced by the Sora project.
Pushing the Frontiers of Audio Generation
Google DeepMind has advanced audio generation technology, enabling natural digital interactions and long-form dialogues. Their latest model improves efficiency and quality while emphasizing responsible AI development and future integration with other media.
Demis Hassabis:'We will need a handful of breakthroughs before we reach AGI'
Demis Hassabis, CEO of Google DeepMind, believes achieving artificial general intelligence requires multiple breakthroughs. He emphasizes AI's benefits in scientific research, particularly through AlphaFold, despite environmental concerns.
Genie 2: A large-scale foundation world model
Google DeepMind's Genie 2 is a foundation model that creates diverse 3D environments from a single image, enhancing AI training and research towards general artificial intelligence through complex interactions and memory capabilities.