Waymo taps Google Deepmind's Genie 3 to simulate driving scenarios its cars have never seen

· Source: The Decoder · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Novice, short

Summary

Waymo has introduced the Waymo World Model, a new generative simulation model for autonomous driving, built upon Google DeepMind's Genie 3. This system generates hyper-realistic and unusual traffic scenarios, such as elephants on the road, tornadoes, or flooded streets, which are rarely encountered in real-world driving. Unlike traditional simulation models that rely solely on driving data, the Waymo World Model leverages Genie 3's extensive world knowledge, acquired from a diverse video dataset, and converts it into 3D lidar outputs through specialized post-training. This allows the model to create both camera and lidar data, enhancing the Waymo Driver's preparedness for complex and unforeseen situations before public road deployment. Waymo also offers control mechanisms like driving action, scene layout, and text prompts for scenario generation, alongside a leaner version for large-scale simulations.

Key takeaway

For AI Scientists developing autonomous driving systems, the Waymo World Model demonstrates a critical shift from data-limited simulations to generative models. You should explore integrating broad world knowledge models like Genie 3 to create diverse, rare, and complex scenarios, significantly enhancing the robustness and safety validation of your autonomous drivers beyond real-world data constraints.

Key insights

Waymo's new generative simulation model, based on Genie 3, creates rare driving scenarios for autonomous vehicle training.

Principles

Method

The Waymo World Model uses Genie 3's pre-trained world knowledge, post-trained to convert 2D video data into 3D lidar outputs, generating camera and lidar data for diverse, hyper-realistic driving simulations.

In practice

Topics

Best for: Computer Vision Engineer, AI Scientist, Research Scientist, AI Engineer, AI Product Manager, Tech Journalist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.