Mesh-Aware Epipolar Matching for Multi-View Multi-Person 3D Pose Estimation in Basketball

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computer Vision, Robotics & Autonomous Systems · Depth: Expert, quick

Summary

Mesh-Aware Epipolar Matching (MAEM) is a novel, training-free framework designed for multi-view multi-person 3D pose estimation in challenging team sports like basketball. Addressing issues such as player occlusions, appearance similarities from uniforms, and limited annotated multi-view data, MAEM leverages a monocular 3D human mesh recovery model as its initial stage. It then implements a two-stage epipolar matching strategy, utilizing the recovered mesh outputs. This framework integrates disjoint-set-union-based clustering with per-joint triangulation to ensure robust cross-view association and precise 3D pose reconstruction. Experiments on public basketball datasets, SportCenter EPFL and Human-M3 Basketball, show MAEM consistently outperforms existing training-free association baselines. It achieves MPJPE/PA-MPJPE scores of 59.8/40.7 mm and 74.0/51.8 mm respectively, demonstrating the effectiveness of dense mesh geometry for association without requiring target-domain training or fine-tuning.

Key takeaway

For Computer Vision Engineers developing multi-person 3D pose estimation systems in challenging environments like sports, consider integrating training-free approaches. Your team can achieve robust cross-view association and accurate 3D pose reconstruction by leveraging dense mesh geometry from monocular recovery models. This method, exemplified by MAEM's 59.8/40.7 mm MPJPE scores, avoids extensive data annotation and domain-specific training, streamlining development and deployment.

Key insights

Dense mesh geometry significantly enhances training-free multi-view 3D pose estimation in complex sports environments.

Principles

Method

MAEM uses a monocular 3D human mesh recovery frontend, followed by a two-stage epipolar matching strategy combining disjoint-set-union clustering and per-joint triangulation for 3D pose.

In practice

Topics

Best for: Research Scientist, AI Scientist, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.