Decart lays the foundation for physical AI systems with Oasis 3

· Source: Dataconomy · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Emerging Technologies & Innovation · Depth: Advanced, medium

Summary

Decart, an AI startup, has released Oasis 3, the third generation of its world model series, designed to lay the foundation for "physical AI" systems like autonomous robots and vehicles. This release follows Decart's recent \$300 million funding round from investors including Nvidia and Toyota. Oasis 3 significantly advances world models by introducing real-time, action-conditioned video generation, enabling hyper-realistic simulations crucial for scaling robotic reinforcement learning. Accessible via a live API, it generates "endless" 3D worlds at 22 frames per second and 768px resolution, with less than 200 milliseconds latency. Key improvements include accurate physics simulation and multiview camera synchronization, providing three perfectly aligned camera angles for depth perception. This infrastructure supports training for diverse applications, from self-driving cars navigating complex scenarios to humanoid robots developing fine motor skills.

Key takeaway

For Robotics Engineers developing autonomous systems, Oasis 3 fundamentally changes how you approach training. You can now generate infinite, hyper-realistic, physics-based environments with real-time feedback and multiview camera synchronization, eliminating the need for costly real-world scenario recreation. This allows you to rapidly scale reinforcement learning for self-driving cars, industrial drones, or humanoid robots, accelerating deployment and reducing development cycles.

Key insights

Oasis 3 provides hyper-realistic, real-time simulation environments critical for training autonomous physical AI systems.

Principles

Method

Oasis 3 uses real-time, action-conditioned video generation to create "endless" 3D worlds, integrating advancements from previous Oasis versions and Lucy video models for physics-based environments with low latency and multiview camera synchronization.

In practice

Topics

Best for: Computer Vision Engineer, AI Scientist, Research Scientist, Robotics Engineer, Machine Learning Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Dataconomy.