AutoGrad Changed Everything (Not Transformers) [Dr. Jeff Beck]
Summary
Dr. Jeff Beck, a mathematician from Northwestern University, argues that AutoGrad, not Transformers, was the pivotal innovation driving the recent explosion in AI development. He posits that AutoGrad transformed AI from a painstaking process of manually constructing neural networks and learning rules into an engineering problem, enabling rapid experimentation with diverse architectures and the discovery of solutions to issues like vanishing gradients. Beck advocates for a shift from purely function approximation models to cognitively inspired, object-centered models that mirror how the brain and the physical world are structured. He emphasizes the importance of Bayesian inference, continuous learning, and a "lots of little models" approach, drawing parallels to video game engines, to achieve more robust, generalizable, and data-efficient AI, particularly for robotics. He also discusses the challenges of AI alignment, suggesting that current reward-based systems are problematic due to the ambiguity of reward function selection and the conflation of beliefs and values.
Key takeaway
For research scientists developing advanced AI, you should prioritize building models that incorporate explicit, object-centered causal structures, rather than relying solely on large-scale function approximation. Focus on integrating approximate Bayesian inference and continuous learning mechanisms to enhance generalization, data efficiency, and the ability to handle novel situations, moving beyond expert trajectory learning for robust real-world deployment in robotics and other complex domains.
Key insights
AutoGrad, not Transformers, fundamentally changed AI development by making it an engineering problem, enabling rapid experimentation.
Principles
- Bayesian inference offers a normative approach to empirical inquiry.
- The brain's efficiency suggests optimal cue combination and uncertainty handling.
- Causal models simplify systems and enable effective intervention.
Method
Develop AI using cognitively inspired, object-centered models, grounded in macroscopic physics, employing approximate Bayesian inference and continuous learning, akin to a video game engine's asset management.
In practice
- Utilize normalizing flows for tractable probabilistic reasoning in deep learning.
- Employ a "lots of little models" approach for data-efficient training.
- Design AI to track surprisal and query external model banks for unknown objects.
Topics
- AutoGrad
- Bayesian Inference
- Cognitively Inspired AI
- Object-Centered Models
- AI Alignment
Best for: Research Scientist, AI Researcher, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning Street Talk.