NERVE: A Neuromorphic Vision and Radar Ensemble for Multi-Sensor Fusion Research
Summary
NERVE (Neuromorphic Vision and Radar Ensemble) is a new multi-sensor dataset designed for neuromorphic perception research, comprising 257 minutes of synchronized recordings from five sensors: two Dynamic Vision Sensors (DVS), an RGB-D camera, and two Radar units (24GHz and 77GHz). Captured over 12 days in office environments, the dataset totals 600 GB of uncompressed, temporally aligned data, featuring approximately 914,000 frames and 9.6 million COCO-formatted annotations across 16 object categories. The dataset includes an open-source software toolkit for preprocessing and format conversion. Baseline experiments on a DVS+Radar subset for human detection and distance estimation show that combining DVS with 77GHz Radar consistently improves detection, with recurrent models achieving up to 47.5% mAP and mean absolute Radar distance errors below 1.8 m against LiDAR ground truth.
Key takeaway
For research scientists developing perception systems for smart spaces, NERVE offers a unique benchmark for DVS-Radar fusion. You should explore recurrent neural network architectures, such as RVT-Tiny, to fully exploit the temporal dynamics of event-based and radar data, as these models significantly outperform feed-forward approaches. Consider the trade-off between high-resolution DVS for detection accuracy and lower-resolution options for potential power savings in IoT deployments.
Key insights
NERVE is a multi-sensor dataset bridging neuromorphic vision and radar for robust perception research.
Principles
- Multi-sensor fusion overcomes individual sensor limitations.
- Temporal context is critical for motion-dependent sensing.
- Higher spatial resolution improves detection performance.
Method
The dataset uses an automatic labeling pipeline with YOLOv8x on RGB frames, then maps annotations to DVS coordinates via extrinsic calibration and depth-based 3D reprojection, ensuring dense, cross-sensor labels.
In practice
- Combine DVS with 77GHz Radar for improved human detection.
- Utilize recurrent models for superior event-based object detection.
- Explore high-resolution DVS for enhanced spatial detection.
Topics
- NERVE Dataset
- Neuromorphic Vision
- Multi-Sensor Fusion
- Dynamic Vision Sensors
- mmWave Radar
Code references
Best for: Research Scientist, Computer Vision Engineer, AI Scientist, Machine Learning Engineer, Robotics Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.