RCT: A Robot-Collected Touch-Vision-Language Dataset for Tactile Generalization

· Source: Artificial Intelligence · Field: Technology & Digital — Robotics & Autonomous Systems, Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, quick

Summary

RCT (Robotic Contact Tactile) is a new robot-collected touch-vision-language dataset designed to address tactile generalization challenges for robots manipulating open-world objects. Comprising 29,279 tactile frames, RCT captures full robot presses on 122 industrial reference materials across 7 categories, utilizing three DIGIT sensors at various contact positions. Each press is preserved as a contact sequence, enabling rigorous held-out evaluation across materials, categories, and sensors. The dataset reveals that frame-random splits can introduce near-duplicate observations, and removing contact-sequence overlap significantly reduces tactile-to-text Recall@1 by 17.7 percentage points. Furthermore, holding out materials during training causes a sharp performance drop, with held-out-material Recall@1 averaging 25.1 +/- 6.1%. RCT facilitates reproducible contact-sequence-aware evaluation and underscores novel-material generalization as a critical hurdle for robotic tactile perception.

Key takeaway

For Machine Learning Engineers developing tactile perception models, you must rigorously evaluate generalization to novel materials. Frame-random data splits can inflate performance; instead, ensure your test sets contain entirely held-out contact sequences and materials. This approach, facilitated by datasets like RCT, will reveal true model capabilities and highlight the critical challenge of novel-material generalization, guiding more robust model development for open-world robotic applications.

Key insights

Robot tactile perception struggles with novel material generalization, exacerbated by data collection biases.

Principles

Method

The proposed method involves collecting full robot presses on diverse materials with multiple sensors, preserving contact sequences, and using uniform press sampling for contrastive training.

In practice

Topics

Best for: Research Scientist, AI Scientist, Robotics Engineer, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.