If You Can't See Inside, How Do You Know It's THINKING? [Dr. Jeff Beck]
Summary
Dr. Jeff Beck and an interviewer discuss various advanced AI concepts, beginning with geometric deep learning's role in modeling the physical world by incorporating symmetries like translation and rotation invariance. The conversation then shifts to defining and measuring "agency" in AI, distinguishing between a simple computational agent and one exhibiting planning, counterfactual reasoning, and goal-oriented behavior. They explore the challenge of identifying true planning from sophisticated policy execution, suggesting that an agent requires physical embodiment and a degree of internal sophistication. The discussion transitions to energy-based models (EBMs), highlighting their advantage over traditional function approximation by incorporating inductive priors and internal state optimization, with Variational Autoencoders (VAEs) cited as a canonical example. They also touch upon test-time training, transduction, and the relationship between EBMs and Bayesian inference, including Joint Embedding Prediction Architectures (JEPAs) and self-supervised learning's role in maintaining rich representations.
Key takeaway
For AI researchers and scientists developing advanced models, consider integrating geometric deep learning principles to leverage physical symmetries, which can enhance model robustness. When designing intelligent agents, focus on architectures that enable explicit planning and counterfactual reasoning, rather than merely sophisticated policy execution, and explore energy-based models for their inherent inductive priors and dual optimization of internal states and outputs. Your approach to defining and measuring agency should account for internal computational sophistication and potential physical embodiment.
Key insights
Defining AI agency requires observing internal planning, not just sophisticated external behavior, and often implies physical embodiment.
Principles
- Models of the physical world should incorporate existing symmetries.
- Agency is a matter of degree, not a binary state.
- Science prioritizes prediction and data compression.
Method
Energy-based models (EBMs) optimize both prediction error and internal states, effectively placing constraints on input-output relationships, often exemplified by VAEs that regularize internal representations.
In practice
- Pre-process neural datasets with PCA and VAEs to assess signal-to-noise.
- Use self-supervised learning to avoid discarding task-irrelevant information.
Topics
- AI Agency
- Geometric Deep Learning
- Energy-Based Models
- Self-Supervised Learning
- AI Safety
Best for: AI Researcher, AI Scientist, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning Street Talk.