Blind Dexterous Grasping via Real2Sim2Real Tactile Policy Learning
Summary
A novel Real2Sim2Real framework addresses challenges in tactile-only blind grasping for multi-fingered robotic hands, specifically the tactile sim-to-real gap and the limited expressiveness of sparse tactile signals. The approach integrates three key components: a Real2Sim tactile calibration pipeline that constructs a contact-calibrated digital-twin simulator, a layout-aware tactile encoder improving sparse observation expressiveness via self-supervised pretraining, and a tactile-conditioned Diffusion Policy aggregating successful grasp trajectories from object-specific reinforcement learning experts. Evaluated on a physical LEAP Hand equipped with 44 distributed tactile channels across 10 seen and 10 unseen objects, the deployed policy achieved a 27% real-world grasp success rate without real-world demonstrations or visual input. Simulation ablations confirmed that layout-aware pretraining enhances performance, and Real2Sim calibration increases tactile contact consistency between simulation and hardware.
Key takeaway
For Robotics Engineers developing dexterous manipulation systems, especially those facing sim-to-real transfer challenges with tactile feedback, this research offers a practical blueprint. You should prioritize implementing contact-event calibration to align simulated and real tactile signals, and integrate geometry-aware tactile representation learning. Adopting a diffusion-based policy can further improve generalization for blind grasping across diverse objects, even with sparse tactile inputs.
Key insights
Calibrated simulation and geometry-aware tactile representations enable blind dexterous grasping on physical robots.
Principles
- Contact-event calibration reduces sim-to-real discrepancy.
- Spatial layout priors enhance sparse tactile observations.
- Diffusion policies aggregate diverse grasping behaviors.
Method
The framework involves Real2Sim tactile calibration, self-supervised pretraining of a layout-aware tactile encoder, and aggregating object-specific RL expert trajectories into a tactile-conditioned Diffusion Policy.
In practice
- Use binary tactile signals for robust sim-to-real transfer.
- Pretrain tactile encoders with privileged geometric supervision.
- Employ diffusion policies for multimodal action generation.
Topics
- Blind Grasping
- Dexterous Manipulation
- Tactile Sim-to-Real
- Diffusion Policy
- Tactile Representation Learning
- LEAP Hand
Code references
Best for: Research Scientist, AI Scientist, Robotics Engineer, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.