BRICKS-WM: Building Reusability via Interface Composition Kinetics for Structured World Models

2026-06-15 · Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, quick

Summary

BRICKS-WM (Building Reusability via Interface Composition Kinetics for Structured World Models) is a novel framework designed to enhance reusability in Model-based Reinforcement Learning (MBRL). It addresses the limitation of prevailing MBRL approaches that use monolithic latent dynamics, which tightly couple environment dynamics and necessitate retraining the entire world model when an agent changes. BRICKS-WM posits that global dynamics can be modeled as a composition of independent dynamical modules interacting through learned latent interfaces. As a minimal example, it factorizes the latent state into an actuated Agent module and an external Background module, connected by a learned interface. This approach enforces a functional separation in transition dynamics, ensuring the background dynamics remain independent of the agent's dynamics. Empirically, BRICKS-WM achieves control performance comparable to strong monolithic baselines when trained from scratch, and critically, enables the reuse of frozen background dynamics across different agents.

Key takeaway

For Machine Learning Engineers developing Model-based Reinforcement Learning systems, BRICKS-WM offers a crucial architectural shift. If your projects involve iterating on agents or transferring them across similar environments, adopting a modular world model like BRICKS-WM can drastically reduce retraining time and computational expense. You can reuse frozen background dynamics, accelerating development cycles and making MBRL more practical for complex, evolving applications. Evaluate its functional separation approach to improve system reusability.

Key insights

BRICKS-WM enables modular, reusable world models in MBRL by functionally separating agent and environment dynamics.

Principles

Global dynamics can be composed of distinct interacting modules.
Functional separation in transition dynamics enhances reusability.
Background dynamics should be agnostic to agent dynamics.

Method

BRICKS-WM factors the latent state space into an actuated Agent module and an external Background module, bridged by a learned latent interface, enforcing functional separation.

In practice

Reuse frozen background dynamics across different agents.
Avoid retraining entire world models for agent changes.

Topics

Model-based Reinforcement Learning
World Models
Modular AI
Latent Dynamics
Agent-Environment Interaction
Reusability

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Robotics Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.