Dell Technologies World 2026 Keynote | May 18–21 | Las Vegas

· Source: NVIDIA · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Cloud Computing & IT Infrastructure · Depth: Advanced, medium

Summary

Nvidia CEO Jensen Huang, speaking at Dell Technologies World 2026, highlighted the advent of "useful AI" driven by agentic systems. These systems understand, reason, plan, use tools, and iterate, leading to a 100X to 1,000X increase in computational demand for tasks that can run for a week, yet deliver significant productivity gains. Huang detailed Nvidia's architecture for agents, which integrates large language models on powerful systems like the NVLink72, a secure NVIDIA Open Shell sandbox, and a NemoCloud reference harness running on a CPU. The architecture leverages Nvidia's Vera CPU, optimized for token generation with industry-leading single-threaded performance and three times the memory bandwidth. This comprehensive platform supports all frontier and open-source AI models, operates across cloud and local environments, and incorporates confidential computing. A demonstration featured a desk-side computer capable of running a one trillion-parameter AI model, a feat previously considered impossible.

Key takeaway

For AI Architects evaluating infrastructure for next-generation AI, recognize that agentic systems demand a 100X to 1,000X increase in computational resources. Your strategy should prioritize hybrid AI architectures that support both cloud and local deployment, leveraging specialized CPUs like Nvidia's Vera for efficient token generation. Consider confidential computing for sensitive models and plan for orchestrating multiple agents to maximize productivity gains in areas like software development and DevOps.

Key insights

Agentic AI drives unprecedented productivity and computational demand, requiring specialized, hybrid architectures.

Principles

Method

Nvidia's architecture for agents involves a secure sandbox (NVIDIA Open Shell), a CPU-based harness (NemoCloud), and large language models running on high-performance systems or in the cloud.

In practice

Topics

Best for: CTO, VP of Engineering/Data, MLOps Engineer, AI Architect, Director of AI/ML, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by NVIDIA.