Google plans nearly two million new AI chips as it turns to Marvell for custom designs
Summary
Google is reportedly collaborating with Marvell Technology to develop two new specialized AI chips for its data centers, aiming to produce nearly two million units. One chip is a memory processing unit designed to complement Google's existing Tensor Processing Units (TPUs) by handling AI tasks based on memory and compute requirements. The second chip is a new TPU specifically optimized for AI inference, which involves running finished AI models. This strategic move, with designs expected to be finalized by next year, also seeks to reduce Google's reliance on its current chip design partner, Broadcom, despite Broadcom having a contract with Google through 2031.
Key takeaway
For CTOs and VPs of Engineering evaluating AI infrastructure, Google's shift towards custom Marvell-designed chips for inference and memory processing signals a critical trend. You should assess your organization's reliance on single-vendor AI hardware and explore custom silicon partnerships to optimize performance and mitigate supply chain risks. Consider how specialized inference hardware could impact your operational costs and model deployment strategies.
Key insights
Google is developing custom AI chips with Marvell to enhance inference capabilities and diversify its supply chain.
Principles
- Custom silicon optimizes AI workload performance.
- Diversifying chip partners reduces vendor dependency.
In practice
- Integrate memory processing units with TPUs.
- Develop TPUs specialized for inference tasks.
Topics
- Google AI Chips
- Marvell Technology
- Tensor Processing Units
- AI Inference
- Memory Processing Units
Best for: Investor, CTO, VP of Engineering/Data, AI Hardware Engineer, AI Architect, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.