Impact of Hand Impairment and Occlusions on Hand Pose Estimation Accuracy in Augmented Reality Applications

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Emerging Technologies & Innovation · Depth: Expert, quick

Summary

A study assessed hand pose estimation accuracy in Augmented Reality (AR) applications, particularly for hand rehabilitation, comparing the HoloLens 2 HMD with established pose estimation algorithms like WiLoR, HaMeR, WildHands, and MediaPipe. The research involved 13 individuals with cervical spinal cord injury (cSCI) and 15 uninjured controls interacting with clear and opaque objects. Ground truth 3D joint positions were established using a multi-camera setup. Findings indicate that accuracy did not differ between cSCI and uninjured groups, suggesting generalizability to hand-impaired populations. Clear objects offered a minor 0.1 mm accuracy advantage over opaque objects. Furthermore, WiLoR and HaMeR algorithms demonstrated a slight 2 mm accuracy improvement over the HoloLens 2. These results suggest the HoloLens 2's potential viability for hand rehabilitation applications and highlight the generated dataset's utility for refining pose estimation methods.

Key takeaway

For Computer Vision Engineers developing AR hand rehabilitation applications, this research indicates that HoloLens 2 offers viable hand pose estimation, even for impaired users. You should consider its direct integration for practical deployments, but be aware that algorithms like WiLoR and HaMeR provide a 2 mm accuracy edge. When designing interactions, prioritize clear objects, as they yield a slight 0.1 mm accuracy advantage, optimizing user experience and data quality.

Key insights

Hand pose estimation in AR generalizes to impaired hands, with minor accuracy impacts from object opacity and HMDs versus algorithms.

Principles

Method

The study assessed HoloLens 2 and four algorithms against multi-camera ground truth, using cSCI and control groups interacting with clear/opaque objects.

In practice

Topics

Best for: AI Scientist, Computer Vision Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.