Slot-MPC: Goal-Conditioned Model Predictive Control with Object-Centric Representations

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, quick

Summary

Slot-MPC is a novel object-centric world modeling framework designed for goal-conditioned Model Predictive Control (MPC) in robotic manipulation tasks. It utilizes vision encoders to learn slot-based representations, which individually encode objects within a scene. These structured representations then facilitate the learning of an action-conditioned object-centric dynamics model. During inference, this learned dynamics model enables action planning via MPC, allowing agents to adapt to novel situations. The framework employs gradient-based MPC for action optimization, which offers greater computational efficiency compared to gradient-free, sampling-based MPC methods. Experimental results on simulated robotic manipulation tasks demonstrate that Slot-MPC enhances both task performance and planning efficiency when compared to non-object-centric world model baselines.

Key takeaway

For research scientists developing robotic manipulation systems, Slot-MPC offers a path to more generalizable and efficient action planning. You should consider integrating object-centric world models with gradient-based MPC to improve performance in novel situations, especially when working with limited state-action coverage in offline settings.

Key insights

Slot-MPC combines object-centric representations with gradient-based MPC for efficient, generalizable robotic action planning.

Principles

Method

Slot-MPC learns slot-based object representations via vision encoders, then an action-conditioned dynamics model, enabling gradient-based MPC for planning and adaptation.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Robotics Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.