OpenAI reveals its first AI processor: Jalapeño
Summary
OpenAI has unveiled Jalapeño, its first custom "intelligence processor" chip, developed in partnership with Broadcom. This Application-Specific Integrated Circuit (ASIC) is engineered specifically for AI inference, powering current and future large language models like ChatGPT. The announcement, made on Wednesday, comes nine months after OpenAI initiated its collaboration with Broadcom to reduce its dependence on Nvidia's limited-supply GPUs. Broadcom CEO Hock Tan states that Jalapeño's performance rivals Nvidia's Blackwell chips and Google's Tensor processing units. This move places OpenAI among other major AI companies, including Microsoft, Meta, and Amazon, which have also introduced custom AI chips for their server infrastructure. OpenAI anticipates deploying Jalapeño, described as the "first step in a multi-generation compute platform," by the end of 2026, with early tests indicating substantially better performance per watt.
Key takeaway
For AI Architects evaluating infrastructure for large language models, OpenAI's Jalapeño chip signals a growing trend towards custom ASICs. You should consider how dedicated inference hardware can reduce your reliance on general-purpose GPUs and potentially offer superior performance per watt. Begin exploring custom chip roadmaps from major providers to inform your long-term compute strategy and optimize operational costs.
Key insights
OpenAI's Jalapeño chip, an ASIC for AI inference, aims to reduce GPU reliance and boost LLM performance.
Principles
- Custom ASICs reduce reliance on general-purpose GPUs.
- AI inference chips optimize for specific model execution.
- Performance per watt is a key metric for AI hardware.
In practice
- Deploy custom chips for LLM inference.
- Evaluate ASICs for power efficiency gains.
- Diversify AI hardware suppliers.
Topics
- OpenAI
- Broadcom
- Jalapeño
- AI Inference
- Custom AI Chips
- Large Language Models
- ASICs
Best for: Investor, CTO, VP of Engineering/Data, Tech Journalist, AI Hardware Engineer, AI Architect
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Verge.