How robots learn: A brief, contemporary history
Summary
The robotics industry is experiencing a significant boom, with companies and investors injecting $6.1 billion into humanoid robots in 2025, quadrupling 2024 investments. This surge is driven by a paradigm shift in how robots learn, moving from rigid, rule-based programming to AI-driven methods. Early approaches, like OpenAI's Dactyl in 2018, utilized trial-and-error in simulations with "domain randomization" to adapt to real-world variations, though this method has limitations. The advent of large language models (LLMs) like ChatGPT in 2022 further catalyzed this shift, enabling robots to process vast amounts of data, including images and sensor readings, to predict actions. Google DeepMind's RT-2, released in 2023, trained on general internet images to enhance object interpretation, while Covariant's RFM-1, launched in 2024, allows for natural language interaction in warehouse settings. Companies like Agility Robotics are deploying humanoid robots, such as Digit, for tasks like moving shipping totes, demonstrating practical applications of these advanced learning techniques.
Key takeaway
For robotics engineers and product managers developing next-generation autonomous systems, you should prioritize integrating large AI models capable of processing diverse data types. Focus on iterative deployment to gather real-world data, which is crucial for refining robot performance and adaptability. This approach will enable more robust and versatile robots, moving beyond scripted actions to genuinely adaptive and interactive capabilities, despite the inherent risks of AI-generated actions.
Key insights
AI models and large datasets are revolutionizing robot learning, enabling adaptation and complex task execution.
Principles
- Simulated trial-and-error accelerates robot learning.
- Domain randomization improves real-world transferability.
- Large language models enhance robot perception and action.
Method
Robots learn by processing vast datasets (text, images, sensor readings) via AI models to predict next actions, often refined through simulated trial-and-error and real-world deployment for continuous learning.
In practice
- Use simulation with domain randomization for training.
- Integrate vision-language models for object interpretation.
- Deploy robots for data collection in target environments.
Topics
- Humanoid Robots
- Large Language Models
- Simulation Training
- Foundation Models
- Domain Randomization
Best for: Research Scientist, Investor, Entrepreneur, Robotics Engineer, AI Scientist, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by MIT Technology Review.