Elon won after all

· Source: Theo - t3․gg · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cloud Computing & IT Infrastructure, Emerging Technologies & Innovation · Depth: Intermediate, extended

Summary

Major tech companies like Microsoft, Google, and Anthropic are facing a severe compute crisis, limiting their growth and revenue. Microsoft expects capacity constraints through H1 2025, while Google and Anthropic are paying SpaceX billions monthly for access to GPUs, despite Google manufacturing its own TPUs. This constraint extends beyond H100 GPUs to critical components like hard drives, with Western Digital sold out for 2026 and prices doubling, and high-bandwidth memory (HBM), where manufacturers like Micron are reallocating production from consumer to data center. Power availability is also a significant bottleneck, as US grid expansion lags demand. Scaling manufacturing for silicon (TSMC) and HBM requires 8-10 years, making quick resolution impossible. SpaceX, having overbought compute, is now a major supplier, while OpenAI's early investment in compute positioned it favorably. Nvidia remains the primary beneficiary of this insatiable demand.

Key takeaway

For AI/ML Directors and Architects planning future infrastructure, recognize that the severe compute crisis is a fundamental, long-term challenge, not a temporary market fluctuation. Your ability to scale AI initiatives will be directly constrained by access to GPUs, HBM, storage, and power. Prioritize securing compute resources and associated infrastructure now, as prices are unlikely to decrease soon, and supply chain lead times for new capacity span years.

Key insights

The global AI compute supply chain is severely bottlenecked across silicon, memory, storage, and power, driving unprecedented demand and costs.

Principles

In practice

Topics

Best for: CTO, VP of Engineering/Data, Investor, Director of AI/ML, AI Architect, Executive

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Theo - t3․gg.