AMD and Red Hat target enterprise AI costs with broader compute choice
Summary
AMD and Red Hat are partnering to address the escalating costs of enterprise AI adoption by offering a broader range of compute choices. John Hampton, AMD's corporate VP of global enterprise technical sales, highlighted at Red Hat Summit 2026 that organizations are struggling with "tokenomics" – the cumulative cost of AI queries and agentic workloads. The collaboration aims to provide an open, full-spectrum compute portfolio, including AMD EPYC CPUs, cost-effective GPUs like the new AMD Instinct MI350P, and high-end accelerators, all supported by Red Hat's open software stack. This approach allows enterprises to match AI workloads to the most optimal and affordable infrastructure, reducing total cost of ownership and freeing up budget and power for AI initiatives. The goal is to simplify AI deployment across hybrid environments and move beyond expensive, one-size-fits-all GPU clusters.
Key takeaway
For CTOs and MLOps Engineers grappling with exploding AI inference costs, your strategy must prioritize "AI choice" to optimize spending. Evaluate your AI workloads to determine if they truly require high-end GPUs or if more cost-effective CPUs or lower-power GPUs, like AMD's MI350P, can suffice. Conduct proofs of concept with partners like AMD and Red Hat to assess the financial and technological impact of tailored compute solutions before making large-scale infrastructure investments.
Key insights
Matching AI workloads to optimal compute resources is crucial for controlling escalating enterprise AI costs.
Principles
- AI choice reduces total cost of ownership.
- Open ecosystems simplify complex AI deployments.
Method
Assess AI use cases to map them to the most optimal compute solutions across CPUs, cost-effective GPUs, and high-end accelerators, leveraging an open software stack for hybrid environments.
In practice
- Utilize AMD Instinct MI350P for cost-effective inferencing.
- Consolidate servers with AMD EPYC CPUs and Red Hat virtualization.
Topics
- Enterprise AI Costs
- AI Compute Choice
- Tokenomics
- AMD Instinct MI350P
- Red Hat AI
Best for: CTO, Executive, MLOps Engineer, Director of AI/ML, AI Architect, VP of Engineering/Data
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by AI – SiliconANGLE.