AGWM: Affordance-Grounded World Models for Environments with Compositional Prerequisites

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, extended

Summary

The Affordance-Grounded World Model (AGWM) addresses limitations in standard world models that fail to track dynamic action executability in environments with compositional prerequisites, termed Structure-Changing (SC) events. Conventional models often internalize action-outcome correlations without considering preconditions, leading to compounding prediction errors in multi-step rollouts and poor generalization to novel configurations. AGWM learns an abstract affordance structure, represented as a Directed Acyclic Graph (DAG) of prerequisite dependencies, to explicitly track the dynamic executability of actions. It incorporates an SC Classifier to detect structure changes and a Dynamic Affordance Graph that enforces a frontier-mask constraint, ensuring only feasible actions are considered during imagination. Experiments in game-based simulated environments demonstrate AGWM's effectiveness, achieving lower multi-step prediction error, better generalization to novel configurations, and improved interpretability compared to baseline models.

Key takeaway

For research scientists developing model-based reinforcement learning agents in compositional domains, AGWM offers a robust solution to the challenge of dynamic action executability. By explicitly modeling affordance structures and their evolution, you can significantly reduce multi-step prediction errors and enhance generalization to unseen rule combinations. Consider integrating AGWM's self-evolving graph and frontier-mask constraint to improve the safety and reliability of your autonomous agents by ensuring imagined trajectories adhere to feasible action sequences.

Key insights

AGWM explicitly tracks dynamic action executability via a self-evolving affordance graph, reducing prediction errors in complex environments.

Principles

Method

AGWM augments a recurrent world model with an SC Classifier and a Graph Predictor, using a Graph Encoder to embed affordance structure into the GRU input and decoder, and enforcing a frontier-mask constraint.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.