Build Your Own AI Assistant with Hugging Face on NVIDIA DGX Spark

· Source: NVIDIA · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Cloud Computing & IT Infrastructure · Depth: Intermediate, quick

Summary

This content describes building a personal AI assistant named Richi, leveraging an NVIDIA DGX Spark for local model execution and privacy. The setup utilizes Brev to manage the DGX Spark as a personal cloud, integrating both a Frontier model API and a locally running open model for email privacy. An intent-based model router directs email-related prompts to the local model while other tasks use the Frontier model. The assistant interacts with the physical world via a Hugging Face Reachi Mini Robot, controlling its head, ears, and camera through tool calls. Voice capabilities are added using the 11 Labs API. The assistant demonstrates tasks like managing to-do lists, sending emails, converting sketches to architectural renderings, creating room tour videos, and even monitoring pets, with the ability to share access to the Spark and Reachi.

Key takeaway

For AI Engineers building custom assistants, consider a hybrid model architecture that combines local, privacy-focused models on hardware like NVIDIA DGX Spark with cloud-based Frontier models. This approach, coupled with intent-based routing, allows you to maintain data privacy for sensitive tasks while leveraging powerful external APIs for general capabilities. Explore integrating physical robots and voice APIs to extend your assistant's utility into the real world.

Key insights

A personal AI assistant can integrate local and cloud models for privacy and physical interaction.

Principles

Method

Set up a personal cloud on DGX Spark, integrate Frontier and local models with an intent router, add physical interaction via a robot, and incorporate voice synthesis.

In practice

Topics

Best for: AI Engineer, Machine Learning Engineer, AI Architect

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by NVIDIA.