Learning to Theorize the World from Observation

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, long

Summary

The Learning-to-Theorize (L2T) paradigm introduces a novel approach to AI understanding, moving beyond mere future prediction to infer explicit explanatory theories from raw, non-textual observations. Inspired by developmental cognitive science, this framework posits that true understanding emerges from constructing internal theories of how the world operates. The Neural Theorizer (NEO) instantiates L2T as a probabilistic neural model that induces latent, executable, and compositional programs, representing a learned "Language of Thought." NEO executes these programs through a shared transition model, enabling explanation-driven generalization. Experiments demonstrate its ability to transfer inferred programs, recombine primitives into unseen compositions, and generalize to programs longer than those observed during training, addressing limitations of traditional world models.

Key takeaway

For AI and Research Scientists developing robust world models, this work suggests shifting focus from optimizing future prediction to inducing explicit, compositional theories. Your systems should aim to discover reusable abstract primitives and learn how to compose them into structured explanations. Implementing mechanisms like NEO's latent program induction, shared execution model, and Minimum Description Length principle can significantly improve compositional and length generalization, crucial for understanding novel phenomena and out-of-distribution scenarios.

Key insights

Understanding the world means inferring explicit, compositional programs from raw observations, not just predicting future states.

Principles

Method

NEO infers latent programs by iteratively selecting primitives via a goal-conditioned policy and executing them through a shared transition model, optimized with an MDL principle for program length.

In practice

Topics

Best for: AI Scientist, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.