From Seeing to Simulating: Generative High-Fidelity Simulation with Digital Cousins for Generalizable Robot Learning and Evaluation

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, extended

Summary

WorldComposer is a generative framework that creates high-fidelity, interactive simulation environments for robot learning directly from real-world panoramas. It establishes a real-to-sim mapping to reconstruct "Digital Twins" and then synthesizes diverse "Digital Cousins" through semantic and geometric editing, significantly expanding data diversity. The framework supports multi-room stitching for large-scale navigation environments and integrates high-quality physics engines with a comprehensive asset library for rigid, articulated, and deformable objects. Experiments demonstrate a strong sim-to-real correlation (Pearson r=0.91) and show that policies trained with WorldComposer's augmented data achieve significantly better generalization to unseen scenes and objects, with success rates increasing from 10% to 85% with more generated data.

Key takeaway

For research scientists developing generalizable robot policies, WorldComposer offers a robust platform to overcome real-world data scarcity. You should consider integrating this framework to generate diverse "Digital Cousins" for training, as it significantly boosts generalization to unseen scenarios and provides a reliable testbed for evaluating policy performance before real-world deployment.

Key insights

WorldComposer generates diverse, high-fidelity simulation environments from real-world panoramas for robust robot learning.

Principles

Method

WorldComposer converts panoramas to editable 3D Gaussian Splatting and collision meshes, stitches multiple rooms, and populates scenes with physics-enabled assets, generating diverse "Digital Cousins" via prompt-driven editing.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Robotics Engineer, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.