CoorDex: Coordinating Body and Hand Priors for Continuous Dexterous Humanoid Loco-Manipulation

· Source: Takara TLDR - Daily AI Papers · Field: Technology & Digital — Robotics & Autonomous Systems, Artificial Intelligence & Machine Learning · Depth: Expert, medium

Summary

CoorDex introduces a novel learning pipeline for continuous, high degree-of-freedom (DoF) humanoid loco-manipulation. It addresses the common simplification of stop-and-go processes and low-DoF end effectors. This system enables a Unitree G1 humanoid, equipped with a 20-DoF WUJI hand, to perform dexterous tasks while in motion. Examples include non-stop bottle grasping and carrying, fridge door opening, and cube pick-and-turn. The pipeline begins with simulated whole-body and hand demonstrations, training privileged motion tracking teachers. These teachers are then distilled into proprioception-conditioned latent priors. These priors serve as the action space for downstream residual reinforcement learning. A coordinated latent residual policy integrates these priors via shared task context and separate body-hand residual heads. This maintains natural whole-body movement and improves finger-level contact reliability. Ablation studies confirm that this latent-prior interface and coordinated residual structure are crucial for training high-dimensional, contact-rich loco-manipulation. Traditional joint-space PPO or monolithic latent prediction methods fail under the same reward budget.

Key takeaway

For Robotics Engineers developing humanoid systems, CoorDex offers a robust method to overcome the limitations of traditional stop-and-go manipulation. You should consider adopting its latent residual control pipeline to integrate high-DoF dexterous hand control with whole-body locomotion. This approach enables more fluid, continuous, and complex humanoid tasks, improving contact reliability and overall system performance in dynamic environments.

Key insights

CoorDex enables continuous, high-DoF humanoid loco-manipulation by coordinating body and dexterous hand control through latent residual learning.

Principles

Method

CoorDex trains privileged motion tracking teachers, distills them into proprioception-conditioned latent priors, and uses these frozen priors as the action space for downstream residual reinforcement learning.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Robotics Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.