From Seeing to Simulating: Generative High-Fidelity Simulation with Digital Cousins for Generalizable Robot Learning and Evaluation
Summary
WorldComposer is a generative framework that creates high-fidelity, interactive simulation environments for robot learning directly from real-world panoramas. It establishes a real-to-sim mapping to reconstruct "Digital Twins" and then synthesizes diverse "Digital Cousins" through semantic and geometric editing, significantly expanding data diversity. The framework supports multi-room stitching for large-scale navigation environments and integrates high-quality physics engines with a comprehensive asset library for rigid, articulated, and deformable objects. Experiments demonstrate a strong sim-to-real correlation (Pearson r=0.91) and show that policies trained with WorldComposer's augmented data achieve significantly better generalization to unseen scenes and objects, with success rates increasing from 10% to 85% with more generated data.
Key takeaway
For research scientists developing generalizable robot policies, WorldComposer offers a robust platform to overcome real-world data scarcity. You should consider integrating this framework to generate diverse "Digital Cousins" for training, as it significantly boosts generalization to unseen scenarios and provides a reliable testbed for evaluating policy performance before real-world deployment.
Key insights
WorldComposer generates diverse, high-fidelity simulation environments from real-world panoramas for robust robot learning.
Principles
- Diverse data improves robot policy generalization.
- Simulation fidelity correlates with real-world performance.
- Generative models bridge real-to-sim gaps efficiently.
Method
WorldComposer converts panoramas to editable 3D Gaussian Splatting and collision meshes, stitches multiple rooms, and populates scenes with physics-enabled assets, generating diverse "Digital Cousins" via prompt-driven editing.
In practice
- Use prompt-driven editing for scene variations.
- Integrate 3DGS with physics engines for interactive simulations.
- Employ multi-room stitching for large-scale navigation tasks.
Topics
- Generative Simulation
- Digital Cousins
- Robot Learning
- Sim-to-Real Transfer
- 3D Gaussian Splatting
Code references
Best for: Research Scientist, AI Scientist, Robotics Engineer, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.