Blind Dexterous Grasping via Real2Sim2Real Tactile Policy Learning

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Robotics & Autonomous Systems, Artificial Intelligence & Machine Learning · Depth: Expert, extended

Summary

A novel Real2Sim2Real framework addresses challenges in tactile-only blind grasping for multi-fingered robotic hands, specifically the tactile sim-to-real gap and the limited expressiveness of sparse tactile signals. The approach integrates three key components: a Real2Sim tactile calibration pipeline that constructs a contact-calibrated digital-twin simulator, a layout-aware tactile encoder improving sparse observation expressiveness via self-supervised pretraining, and a tactile-conditioned Diffusion Policy aggregating successful grasp trajectories from object-specific reinforcement learning experts. Evaluated on a physical LEAP Hand equipped with 44 distributed tactile channels across 10 seen and 10 unseen objects, the deployed policy achieved a 27% real-world grasp success rate without real-world demonstrations or visual input. Simulation ablations confirmed that layout-aware pretraining enhances performance, and Real2Sim calibration increases tactile contact consistency between simulation and hardware.

Key takeaway

For Robotics Engineers developing dexterous manipulation systems, especially those facing sim-to-real transfer challenges with tactile feedback, this research offers a practical blueprint. You should prioritize implementing contact-event calibration to align simulated and real tactile signals, and integrate geometry-aware tactile representation learning. Adopting a diffusion-based policy can further improve generalization for blind grasping across diverse objects, even with sparse tactile inputs.

Key insights

Calibrated simulation and geometry-aware tactile representations enable blind dexterous grasping on physical robots.

Principles

Method

The framework involves Real2Sim tactile calibration, self-supervised pretraining of a layout-aware tactile encoder, and aggregating object-specific RL expert trajectories into a tactile-conditioned Diffusion Policy.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Robotics Engineer, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.