HANDOFF: Humanoid Agentic Task-Space Whole-Body Control via Distilled Complementary Teachers
Summary
HANDOFF, a novel humanoid whole-body controller, addresses the critical challenge of command space for real-world robot deployment by introducing a compact, explicit interface. This interface is designed to be intuitive, general, modular, and expressive, overcoming the difficulty planners face in synthesizing dense kinematic references from task semantics. HANDOFF is developed through multi-teacher KL distillation, employing a context-conditioned gating scheme to create a mixture-of-experts student from three specialized teachers: whole-body motion tracking (using safety-filtered data), locomotion, and fall-recovery. Demonstrated on the Unitree G1 robot, HANDOFF achieves state-of-the-art velocity tracking and provides one of the largest robust manipulation workspaces. Its hardware feasibility is further validated through natural-language-driven task roll-outs, powered by a VLM-driven agentic planner, requiring no task-specific data or controller fine-tuning. The system was published on 2026-06-04.
Key takeaway
For robotics engineers developing humanoid control systems, HANDOFF offers a robust solution to simplify task planning and expand manipulation capabilities. You should consider adopting its compact, explicit command interface to streamline integration with agentic planners. This approach enables natural-language-driven task execution without extensive fine-tuning, potentially accelerating real-world deployment of advanced humanoid robots like the Unitree G1.
Key insights
HANDOFF uses distilled multi-teacher control for robust, intuitive humanoid whole-body command, enabling diverse manipulation and locomotion.
Principles
- Compact interfaces simplify robot task planning.
- Multi-teacher distillation enhances controller robustness.
- Context-conditioned gating improves expert integration.
Method
HANDOFF distills three specialist teachers (motion tracking, locomotion, fall-recovery) into a mixture-of-experts student via multi-teacher KL distillation with context-conditioned gating.
In practice
- Deploy humanoid robots with intuitive command interfaces.
- Achieve state-of-the-art velocity tracking on Unitree G1.
- Enable natural-language-driven task execution.
Topics
- Humanoid Robotics
- Whole-Body Control
- Multi-Teacher Distillation
- Agentic Planning
- Unitree G1
- Task-Space Control
Best for: Research Scientist, Robotics Engineer, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.