Enable Real-Time AI for High-Speed Data Acquisition with DAQIRI

2026-06-22 · Source: NVIDIA Technical Blog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Robotics & Autonomous Systems · Depth: Advanced, medium

Summary

NVIDIA DAQIRI (Data Acquisition for Integrated Real-time Instruments) is a high-performance networking library, part of the NVIDIA Holoscan Platform, designed to enable real-time AI for high-speed data acquisition. It shifts from inflexible hardware-centric designs to an adaptable, software-centric architecture, directly connecting high-bandwidth streaming detectors and sensors to the NVIDIA software ecosystem. DAQIRI allows developers to create workflows for in-stream processing tasks such as filtering, inference, compression, and adaptive control without modifying instruments. This is critical for facilities like the Linac Coherent Light Source II (LCLS-II), generating 1 MHz photon pulses, and the High-Luminosity Large Hadron Collider (HL-LHC) at CERN, which will increase luminosity by a factor of 10. Under the hood, DAQIRI bypasses the Linux kernel using DPDK for zero-copy access, routing data directly from NIC to GPU DMA buffers at 100s of Gbps, ensuring low latency and high throughput. It provides C++ and Python APIs with YAML-driven configuration for ease of deployment, abstracting complex low-level networking.

Key takeaway

For AI Engineers and Research Scientists building real-time data acquisition systems, NVIDIA DAQIRI offers a critical shift from hardware-centric to software-defined processing. You should consider integrating DAQIRI to bypass traditional "store first" bottlenecks, enabling direct streaming of high-bandwidth sensor data to GPUs for immediate AI inference and adaptive control. This allows you to gain real-time insights and prepare high-quality, AI-ready data for downstream supercomputing analysis, significantly accelerating discovery.

Key insights

DAQIRI enables real-time AI processing of high-speed sensor data by directly streaming to GPUs, bypassing traditional bottlenecks.

Principles

Data acquisition bottlenecks shift from missing data to "collect, store, analyze" architectures.
Pre-processing data at the source mitigates information loss from data deluge.
Kernel-bypass networking reduces latency and CPU overhead for high-speed data.

Method

DAQIRI uses DPDK for kernel-bypass, zero-copy data transfer from NIC to GPU DMA buffers, configured via YAML, and processed with C++/Python APIs.

In practice

Stream instrument outputs directly into edge supercomputing systems.
Configure data paths, NICs, GPU memory regions, and flow rules via YAML.
Use C++ or Python APIs to receive GPU-ready tensors for inference.

Topics

Real-Time AI
Data Acquisition
NVIDIA DAQIRI
GPU Acceleration
High-Bandwidth Networking
Edge Computing
DPDK

Code references

NVIDIA/daqiri

Best for: AI Architect, MLOps Engineer, Machine Learning Engineer, AI Engineer, Research Scientist, Software Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by NVIDIA Technical Blog.