OpenAI Unveils First Custom AI Chip With Broadcom

· Source: Bloomberg Tech · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cloud Computing & IT Infrastructure, Emerging Technologies & Innovation · Depth: Intermediate, extended

Summary

OpenAI, in partnership with Broadcom, unveiled "Jalapeno," its first custom AI chip, aiming for 50% lower inference costs and greater infrastructure control, with plans to spend tens of billions on these chips. Simultaneously, SK Hynix plans a landmark \$29.4 billion US listing on July 10th to expand HBM memory chip capacity, capitalizing on surging AI demand and structural shifts in the memory market. Cerebras reported its first quarterly earnings post-IPO, showing record revenues of \$191 million (up 92% YoY) but a stock drop due to margin impacts from renting back gear to meet demand, highlighting its unique architecture that avoids HBM bottlenecks. Separately, SpaceX secured a record \$25 billion investment-grade bond sale, significantly lowering borrowing costs for its integrated entities, including XAI, and Andreessen Horowitz discussed its \$10 billion-plus investment in SpaceX, emphasizing the long-term vision for Starship and potential orbital data centers.

Key takeaway

For Directors of AI/ML evaluating compute strategies, this brief highlights the imperative to diversify hardware beyond traditional GPUs. You should explore custom ASICs like OpenAI's Jalapeno or alternative architectures such as Cerebras's wafer-scale approach to mitigate supply chain bottlenecks and reduce inference costs. Additionally, consider the structural shifts in memory demand, which necessitate securing HBM supply or adopting HBM-free solutions to ensure long-term capacity.

Key insights

Custom AI hardware and strategic capital are critical for scaling AI infrastructure and managing costs.

Principles

Method

Cerebras's wafer-scale architecture avoids HBM, CoWoS, and 3nm node constraints, enabling faster compute deployment for customers.

In practice

Topics

Best for: CTO, VP of Engineering/Data, AI Product Manager, Investor, Director of AI/ML, Consultant

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Bloomberg Tech.