Jensen Huang and Satya Nadella's Conversation at Microsoft Build
Summary
Jensen Huang and Satya Nadella discussed the evolving NVIDIA-Microsoft AI partnership, focusing on edge AI and cloud innovation. They highlighted the transformation of PCs into "personal AIs" via systems like RTX Spark, which offers a petaflop of AI performance and 128 GB of memory to run hundreds of billions of parameter models. On the cloud side, their collaboration built the first AI supercomputers, evolving from Ampere and Hopper for pre-training to Grace Blackwell for post-training and inference, achieving a 30x token generation rate increase over Hopper. The upcoming Vera Rubin system is designed specifically for agentic AIs, emphasizing extremely low latency and confidential computing with encrypted data in transit and use. This partnership also extends to accelerating all Azure tools, including Fabric, SQL, and Spark, with GPUs to meet the surging demand for agentic compute.
Key takeaway
For CTOs and AI/ML Directors evaluating future AI infrastructure, NVIDIA and Microsoft's deep collaboration signals a critical shift towards agentic systems on both edge and cloud. Your strategy should account for personal AI PCs like RTX Spark and cloud systems like Vera Rubin, which prioritize low-latency, confidential computing, and accelerated tools. Plan for integrated hardware-software stacks to maximize agent performance and cost-efficiency, ensuring your deployments are ready for the next generation of autonomous AI.
Key insights
NVIDIA and Microsoft are transforming PCs into personal AIs and scaling cloud agentic systems through deep collaboration.
Principles
- AI is now useful due to agentic systems and powerful models.
- The entire rack can function as one computer via NVLink 72.
- CPUs are being redesigned for low-latency agentic workloads.
In practice
- Run autonomous AI agents directly on RTX Spark PCs with 128 GB memory.
- Deploy large language models (hundreds of billions of parameters) on edge devices.
- Utilize GPU-accelerated tools on Azure for faster agent iteration and token generation.
Topics
- Edge AI
- Personal AI
- Agentic Systems
- Cloud AI Infrastructure
- NVIDIA-Microsoft Partnership
- GPU Acceleration
Best for: Investor, Executive, AI Architect, CTO, VP of Engineering/Data, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by NVIDIA.