OccSim: Multi-kilometer Simulation with Long-horizon Occupancy World Models

· Source: cs.CV updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, extended

Summary

OccSim is presented as the first occupancy world model-driven 3D simulator designed to overcome the limitations of data-driven autonomous driving simulations that rely on pre-recorded driving logs or HD maps. It can generate over 3,000 continuous frames, constructing large-scale 3D occupancy maps spanning over 4 kilometers from a single initial frame and future ego-actions, representing an >80x improvement in stable generation length. OccSim utilizes two modules: W-DiT for ultra-long-horizon static environment generation by incorporating rigid transformations, and a Layout Generator for populating dynamic foregrounds with reactive agents based on synthesized road topology. Data collected from OccSim can pre-train 4D semantic occupancy forecasting models, achieving up to 67% zero-shot performance on unseen data, outperforming previous asset-based simulators by 11%. Scaling the dataset 5x increases zero-shot performance to 74%, expanding the improvement to 22.1%.

Key takeaway

For research scientists developing autonomous driving systems, OccSim offers a novel approach to generate high-fidelity, large-scale simulation data without reliance on traditional maps or logs. You should consider integrating OccSim's generated datasets for pre-training 4D semantic occupancy forecasting models, as it significantly improves zero-shot performance compared to asset-based simulators, enabling more robust model development.

Key insights

OccSim is an occupancy world model-driven 3D simulator enabling multi-kilometer, log-independent autonomous driving simulations.

Principles

Method

OccSim uses a W-DiT module for static scene generation with rigid transformations and a Layout Generator for dynamic agent population, followed by A* routing and 2D-IDM for agent control.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Robotics Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.