ETCH-X: Robustify Expressive Body Fitting to Clothed Humans with Composable Datasets

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, quick

Summary

ETCH-X is an upgraded human body fitting method designed to align parametric body models like SMPL-X to 3D point clouds of clothed individuals, improving upon the original ETCH system. It introduces a tightness-aware fitting paradigm to manage clothing dynamics, effectively "undressing" the input, and replaces sparse markers with implicit dense correspondences for more robust and detailed fitting. This modular design allows for scalable training using diverse datasets such as CLOTH3D for garments, AMASS for full-body motions, and InterHand2.6M for hand gestures. ETCH-X demonstrates significant performance improvements, achieving 33.0% better MPJPE-All on 4D-Dress and 35.8% better V2V-Hands on CAPE for seen data, and 80.8% better MPJPE-All and 80.5% better V2V-All on unseen BEDLAM2.0 data.

Key takeaway

For research scientists developing 3D human reconstruction pipelines, ETCH-X offers a significant advancement in handling clothed subjects and noisy data. You should consider adopting its modular "undress" and "dense fit" stages to improve both the expressiveness and robustness of your body fitting solutions, particularly when working with diverse clothing and complex poses. This approach can lead to substantial performance gains on both seen and unseen datasets.

Key insights

ETCH-X robustly fits expressive body models to clothed 3D scans using a modular, tightness-aware, dense correspondence approach.

Principles

Method

ETCH-X employs a two-stage process: first, "undress" the input using tightness-aware fitting to filter clothing, then perform a "dense fit" using implicit correspondences for fine-grained body alignment.

In practice

Topics

Best for: Research Scientist, AI Scientist, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.