Multi-View In-Cabin Monitoring System for Public Transport Vehicles

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, quick

Summary

A new multi-view in-cabin monitoring dataset has been introduced for public transportation, specifically designed for a digitalized and partly automated German city bus. This dataset comprises 9,136 synchronized samples, featuring RGB and depth images from four inward-facing cameras, alongside data from a rotating LiDAR covering the vehicle's interior. It includes comprehensive annotations and is supported by a calibration and pseudo-labeling pipeline that generates 3D human pose estimates and oriented 3D bounding boxes for occupants. The release also offers a nuScenes-format conversion and benchmarks for representative multi-view 3D detection models, such as Lift-Splat-Shoot and BEVFusion. This resource facilitates comparative evaluation and small-scale training of multi-view in-cabin perception models, with all data and tools publicly available on GitHub.

Key takeaway

For computer vision engineers developing in-cabin monitoring systems for public transport or autonomous vehicles, this new dataset offers a critical resource. You should explore integrating this multi-modal data, including RGB, depth, and LiDAR, to enhance the robustness of your 3D detection and pose estimation models. Utilizing the provided nuScenes-format conversion and benchmarks can accelerate your model evaluation and small-scale training efforts, potentially improving occupant safety and service quality.

Key insights

A new multi-modal dataset and tools enable advanced in-cabin monitoring for public transport, supporting 3D perception model development.

Method

A calibration and pseudo-labeling pipeline generates 3D human pose estimates and oriented 3D bounding boxes from synchronized multi-view RGB-D and LiDAR data.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Computer Vision Engineer, Robotics Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.