Google Releases Two New AI Chips

· Source: AutoGPT · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cloud Computing & IT Infrastructure · Depth: Novice, short

Summary

Google released two new custom AI chips, the TPU 8t and TPU 8i, on April 22, 2026, marking the eighth generation of its Tensor Processing Units. The TPU 8t is designed for AI model training, while the TPU 8i handles inference, a specialized approach intended to optimize performance and reduce costs. Google claims these new TPUs offer up to 3x faster AI model training and 80% better performance per dollar compared to the previous generation. Furthermore, over one million TPUs can now operate together in a single cluster, providing immense computing power. Google is also collaborating with Nvidia to enhance networking software, specifically upgrading the Falcon tool, to improve the efficiency of Nvidia-based systems within Google's cloud infrastructure.

Key takeaway

For CTOs and VPs of Engineering evaluating cloud AI infrastructure, Google's new TPU 8t and 8i chips present a compelling option for optimizing AI training and inference workloads. You should assess the performance-per-dollar improvements and the scalability of these specialized TPUs, especially for large-scale model deployments. Consider how this focused chip architecture could reduce your operational costs and accelerate development cycles compared to general-purpose hardware.

Key insights

Google's new TPU 8t and 8i chips specialize in AI training and inference, aiming for efficiency and cost reduction.

Principles

Method

Google splits AI tasks into dedicated chips: TPU 8t for training and TPU 8i for inference, allowing for fine-tuning and improved performance per dollar.

In practice

Topics

Best for: CTO, VP of Engineering/Data, MLOps Engineer, AI Hardware Engineer, AI Architect, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by AutoGPT.