How to Build In-Vehicle AI Agents with NVIDIA: From Cloud to Car

· Source: NVIDIA Technical Blog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Cloud Computing & IT Infrastructure · Depth: Intermediate, long

Summary

The automotive industry is transitioning from rule-based in-vehicle assistants to agentic, multimodal AI systems capable of reasoning, planning, and acting. These new systems leverage large language models (LLMs), vision-language models (VLMs), and speech models to provide conversational AI with memory, multimodal interaction, and proactive assistance. Global shipments of vehicles with agentic AI are projected to grow from 5 million in 2025 to 70 million by 2035. NVIDIA addresses the challenge of real-time AI at the edge with its DRIVE AGX platforms, offering solutions like the AI Box for augmenting existing infotainment systems, the DRIVE AGX Thor for multi-domain AI centralization, and integrated cockpit computers with MediaTek Dimensity AX. A hybrid edge-cloud architecture is proposed for comprehensive experiences, supported by an agentic AI pipeline involving ASR, orchestrators, LLM inference engines, AI models, and TTS, with development facilitated by the NVIDIA NeMo platform and TensorRT Edge-LLM for deployment.

Key takeaway

For AI Architects and Machine Learning Engineers designing next-generation automotive systems, you should prioritize NVIDIA DRIVE AGX platforms and the NeMo ecosystem to build agentic, multimodal in-vehicle AI. This approach enables scalable, production-ready solutions that meet strict latency and privacy requirements while supporting a hybrid edge-cloud architecture for comprehensive user experiences.

Key insights

Agentic, multimodal AI systems are transforming automotive cockpits, requiring robust edge-cloud architectures for real-time performance.

Principles

Method

Develop agentic AI via an AI factory workflow: train, fine-tune, evaluate models in the cloud using NeMo, then optimize with TensorRT and deploy on DRIVE AGX platforms using TensorRT Edge-LLM.

In practice

Topics

Code references

Best for: AI Engineer, Machine Learning Engineer, AI Architect

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by NVIDIA Technical Blog.