Google Cloud launches two new AI chips to compete with Nvidia

· Source: TechCrunch · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cloud Computing & IT Infrastructure, Emerging Technologies & Innovation · Depth: Intermediate, quick

Summary

Google Cloud has unveiled its eighth generation of custom-built AI chips, Tensor Processing Units (TPUs), which are now specialized into two distinct types: the TPU 8t for model training and the TPU 8i for inference. These new TPUs offer significant performance improvements over previous generations, including up to 3x faster AI model training and 80% better performance per dollar. They also support massive scalability, allowing over 1 million TPUs to operate together in a single cluster, aiming to deliver more compute power with reduced energy consumption and cost. Despite these advancements, Google Cloud continues to integrate Nvidia's latest chips, such as the Vera Rubin, into its infrastructure and is collaborating with Nvidia to enhance networking efficiency for Nvidia-based systems using Google's Falcon technology.

Key takeaway

For CTOs and VP of Engineering evaluating cloud AI infrastructure, Google Cloud's new TPU 8t and 8i offer compelling performance and cost efficiencies for both training and inference. Your teams should consider these specialized TPUs for demanding AI workloads, while also noting Google's continued integration of Nvidia's latest chips and collaborative networking enhancements, which could further optimize hybrid AI deployments.

Key insights

Google Cloud's new specialized TPUs enhance AI training and inference while complementing, not replacing, Nvidia's hardware.

Principles

Method

Google is enhancing cloud AI infrastructure by developing specialized TPUs for training and inference, and by collaborating with Nvidia on networking improvements like Falcon.

In practice

Topics

Best for: CTO, VP of Engineering/Data, MLOps Engineer, AI Architect, Director of AI/ML, Tech Journalist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by TechCrunch.