Oracle Cloud Delivers AI Instances on Intel® Xeon® 6 Processors and NVIDIA RTX PRO Blackwell GPUs
Summary
Oracle Cloud Infrastructure (OCI) has launched new AI instances, including OCI Compute with NVIDIA RTX PRO, powered by Intel Xeon 6 CPUs and NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs. This solution addresses challenges in scaling multi-modal AI applications by consolidating AI, visualization, and simulation workloads onto a single system, reducing costs and complexity. Intel Xeon 6 processors enhance performance by eliminating CPU bottlenecks, ensuring full GPU utilization, and providing ~33%-37% higher memory bandwidth with up to 12 memory channels and DDR5/MRDIMM support. They also feature Intel Advanced Matrix Extensions (Intel AMX) for AI inference acceleration and offer 2x higher core density. Additionally, OCI provides instances with NVIDIA HGX B300 powered by 5th Gen Intel Xeon processors for large-scale training of trillion-parameter models, large-scale inference, Agentic AI, and RAG pipelines, which also include Intel Trust Domain Extensions (Intel TDX) for confidential computing.
Key takeaway
For AI Architects and ML Engineers designing multi-modal AI infrastructure, OCI's new instances with Intel Xeon 6 and NVIDIA RTX PRO Blackwell GPUs offer significant performance and cost benefits. You can consolidate diverse workloads like inference, rendering, and simulation onto a single platform, improving GPU utilization and reducing infrastructure complexity. Consider these offerings to accelerate your AI efforts from pilot to production, especially for latency-sensitive applications and large-scale training.
Key insights
OCI's new AI instances, powered by Intel Xeon 6 and NVIDIA GPUs, optimize multi-modal AI by eliminating CPU bottlenecks and enhancing resource utilization.
Principles
- CPU performance is critical for GPU utilization.
- Heterogeneous computing improves AI workflow efficiency.
- Consolidate workloads to reduce infrastructure complexity.
In practice
- Utilize Intel AMX for CPU-based AI preprocessing.
- Deploy OCI RTX PRO for generative AI and visual computing.
- Leverage Intel TDX for confidential AI workloads.
Topics
- Oracle Cloud Infrastructure
- Intel Xeon 6 Processors
- NVIDIA RTX PRO Blackwell
- Multi-modal AI
- Confidential Computing
- GPU Acceleration
Best for: MLOps Engineer, CTO, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, AI Architect
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence (AI) articles.