Mustafa Suleyman: AI development won’t hit a wall anytime soon—here’s why

· Source: MIT Technology Review · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cloud Computing & IT Infrastructure, Emerging Technologies & Innovation · Depth: Intermediate, short

Summary

The "compute explosion" describes the exponential growth in computational power driving modern AI, a trend that has seen the training data for frontier AI models increase by a trillion-fold, from 10¹⁴ flops in 2010 to over 10²⁶ flops today. This growth is fueled by three converging advances: significantly faster chips like Nvidia's, which increased raw performance sevenfold in six years, and Microsoft's Maia 200; high bandwidth memory (HBM) like HBM3, which triples data bandwidth to keep processors busy; and the development of warehouse-scale supercomputers connecting hundreds of thousands of GPUs via technologies like NVLink and InfiniBand. These hardware improvements, combined with software advancements that halve compute required for fixed performance every eight months, have led to a 50x improvement in training times for language models since 2020, far exceeding Moore's Law predictions. Projections indicate another 1,000x increase in effective compute by late 2028, potentially enabling human-level AI agents and transforming industries built on cognitive work, despite significant energy demands which are being addressed by rapidly falling solar and battery costs.

Key takeaway

For AI Architects and Directors of AI/ML evaluating future infrastructure investments, recognize that the compute explosion is not slowing; your planning should account for a projected 1,000x increase in effective compute by 2028. Prioritize scalable, energy-efficient hardware and software solutions that can integrate into warehouse-scale supercomputing environments to support the development of increasingly complex, human-level AI agents.

Key insights

Exponential compute growth, driven by hardware and software advances, is rapidly accelerating AI capabilities.

Principles

In practice

Topics

Best for: AI Architect, Director of AI/ML, CTO

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by MIT Technology Review.