DAIMON Robotics Wants to Give Robot Hands a Sense of Touch

· Source: IEEE Spectrum · Field: Technology & Digital — Robotics & Autonomous Systems, Artificial Intelligence & Machine Learning · Depth: Intermediate, long

Summary

DAIMON Robotics, a two-and-a-half-year-old company, released Daimon-Infinity in April, described as the largest omni-modal robotic dataset for physical AI. This dataset features high-resolution tactile sensing and covers tasks from laundry folding to factory assembly, supported by partners including Google DeepMind and Northwestern University. DAIMON, known for its monochromatic vision-based tactile sensor packing over 110,000 effective sensing units, leverages this technology and a distributed out-of-lab collection network to generate millions of hours of data annually. The company open-sourced 10,000 hours of this data to accelerate embodied AI deployment. Prof. Michael Yu Wang, DAIMON's chief scientist, pioneered the Vision-Tactile-Language-Action (VTLA) architecture, elevating tactile feedback to a modality on par with vision, addressing the "insensitivity" of current Vision-Language-Action (VLA) models.

Key takeaway

For Robotics Engineers or AI Scientists developing advanced manipulation systems, integrating high-resolution tactile sensing is critical. The Vision-Tactile-Language-Action (VTLA) model, supported by datasets like Daimon-Infinity, addresses limitations of Vision-Language-Action (VLA) by enabling precise force control, slip detection, and object localization in challenging environments. You should consider adopting vision-based tactile sensors to enhance robot dexterity and reliability, especially for tasks involving fragile items or confined spaces, accelerating real-world embodied AI deployment.

Key insights

Tactile sensing is crucial for advanced robot manipulation, enabling the VTLA model to overcome VLA limitations.

Principles

Method

DAIMON's method involves high-resolution vision-based tactile sensors, a robust multimodal data processing pipeline, and a distributed out-of-lab collection network for large-scale data generation.

In practice

Topics

Best for: AI Engineer, Machine Learning Engineer, Research Scientist, Robotics Engineer, AI Scientist, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by IEEE Spectrum.