Learning to Theorize the World from Observation
Summary
The Learning-to-Theorize (L2T) paradigm introduces a novel approach to AI understanding, moving beyond mere future prediction to infer explicit explanatory theories from raw, non-textual observations. Inspired by developmental cognitive science, this framework posits that true understanding emerges from constructing internal theories of how the world operates. The Neural Theorizer (NEO) instantiates L2T as a probabilistic neural model that induces latent, executable, and compositional programs, representing a learned "Language of Thought." NEO executes these programs through a shared transition model, enabling explanation-driven generalization. Experiments demonstrate its ability to transfer inferred programs, recombine primitives into unseen compositions, and generalize to programs longer than those observed during training, addressing limitations of traditional world models.
Key takeaway
For AI and Research Scientists developing robust world models, this work suggests shifting focus from optimizing future prediction to inducing explicit, compositional theories. Your systems should aim to discover reusable abstract primitives and learn how to compose them into structured explanations. Implementing mechanisms like NEO's latent program induction, shared execution model, and Minimum Description Length principle can significantly improve compositional and length generalization, crucial for understanding novel phenomena and out-of-distribution scenarios.
Key insights
Understanding the world means inferring explicit, compositional programs from raw observations, not just predicting future states.
Principles
- Human-like understanding requires theory-building.
- Theories are executable, compositional programs.
- Learning should target reusable explanatory structures.
Method
NEO infers latent programs by iteratively selecting primitives via a goal-conditioned policy and executing them through a shared transition model, optimized with an MDL principle for program length.
In practice
- Train on minimally curated observational data.
- Evaluate by program transferability, not just reconstruction.
- Use state grounding to enforce valid intermediate states.
Topics
- Learning-to-Theorize
- Neural Theorizer
- Program Induction
- Compositional Generalization
- World Models
- Language of Thought
Best for: AI Scientist, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.