OccSim: Multi-kilometer Simulation with Long-horizon Occupancy World Models
Summary
OccSim is presented as the first occupancy world model-driven 3D simulator designed to overcome the limitations of data-driven autonomous driving simulations that rely on pre-recorded driving logs or HD maps. It can generate over 3,000 continuous frames, constructing large-scale 3D occupancy maps spanning over 4 kilometers from a single initial frame and future ego-actions, representing an >80x improvement in stable generation length. OccSim utilizes two modules: W-DiT for ultra-long-horizon static environment generation by incorporating rigid transformations, and a Layout Generator for populating dynamic foregrounds with reactive agents based on synthesized road topology. Data collected from OccSim can pre-train 4D semantic occupancy forecasting models, achieving up to 67% zero-shot performance on unseen data, outperforming previous asset-based simulators by 11%. Scaling the dataset 5x increases zero-shot performance to 74%, expanding the improvement to 22.1%.
Key takeaway
For research scientists developing autonomous driving systems, OccSim offers a novel approach to generate high-fidelity, large-scale simulation data without reliance on traditional maps or logs. You should consider integrating OccSim's generated datasets for pre-training 4D semantic occupancy forecasting models, as it significantly improves zero-shot performance compared to asset-based simulators, enabling more robust model development.
Key insights
OccSim is an occupancy world model-driven 3D simulator enabling multi-kilometer, log-independent autonomous driving simulations.
Principles
- Decouple static and dynamic voxel generation.
- Explicitly use 3D geometric priors for stability.
- Augment training data to prevent overfitting.
Method
OccSim uses a W-DiT module for static scene generation with rigid transformations and a Layout Generator for dynamic agent population, followed by A* routing and 2D-IDM for agent control.
In practice
- Pre-train 4D semantic occupancy models with OccSim data.
- Generate diverse, multi-kilometer road networks.
- Simulate reactive agents without historical logs.
Topics
- Occupancy World Models
- Autonomous Driving Simulation
- W-DiT Architecture
- Long-horizon Generation
- 4D Semantic Occupancy Forecasting
Code references
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Robotics Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.