Google DeepMind has introduced Genie 3, an AI world model capable of generating explorable 3D environments in real time from a simple text prompt.
Unlike earlier versions, it supports several minutes of continuous interaction, basic visual memory, and real-time changes such as altering weather or adding characters.
The system allows users to navigate these spaces at 24 frames per second in 720p resolution, retaining object placement for about a minute.
Users can trigger events within the virtual world by typing new instructions, making Genie 3 suitable for applications ranging from education and training to video games and robotics.
Genie 3's improvements over Genie 2 include frame-by-frame generation with memory tracking and dynamic scene creation without relying on pre-built 3D assets.
However, the AI model still has limits, including the inability to replicate real-world locations with geographic accuracy and restricted interaction capabilities. Multi-agent features are still in development.
Currently offered as a limited research preview to select academics and creators, Genie 3 will be made more widely available over time.
Google DeepMind has noted that safety and responsibility remain central concerns during the gradual rollout.