PhysEditWorld: A Large-Scale Dataset Toward Physics-Editable World Models

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Computer Vision · Depth: Expert, quick

Summary

PhysEditWorld is a new large-scale multimodal dataset designed to enable physics-editable world models, addressing the limitation of current game world models that learn physical dynamics implicitly. This initial version primarily focuses on gravity, utilizing a UE5 replay-and-rendering pipeline. The dataset records normalized action traces and replays them under multiple gravity configurations, ensuring controlled physical variation. PhysEditWorld comprises 12 cinematic UE5 scenes, over 100 hours of gameplay interactions, and more than 60 million rendered rollout frames. Each sample provides synchronized multimodal signals including RGB, depth, normals, audio, action traces, camera trajectory, engine states, semantic annotations, and explicit gravity labels, supporting research into controllable world modeling.

Key takeaway

For AI Scientists and Machine Learning Engineers developing game AI or simulation environments, PhysEditWorld offers a critical resource. Your models can move beyond implicit physical correlations by training on this dataset, which provides explicit gravity labels and controlled physical variations. This enables you to build more robust, physics-faithful dynamics models and enhance consistency when implementing physical edits in authored game environments.

Key insights

PhysEditWorld is a multimodal dataset enabling physics-editable world models through controlled physical parameter variation.

Principles

Method

PhysEditWorld uses a UE5 replay-and-rendering pipeline to record action traces and replay them with varied physical parameters, like gravity, from the same initial state, character, and camera policy.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.