LLMs help robots understand vague instructions and focus on key details

· Source: MIT News - Artificial intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Advanced, medium

Summary

MIT CSAIL has developed "Masked Inverse Reinforcement Learning" (Masked IRL), a novel approach published on June 26, 2026, that uses two large language models to enhance robot understanding of vague instructions and improve focus on critical details. The first LLM clarifies ambiguous user prompts, such as expanding "stay close" to "stay close to the surface of the table," based on kinesthetic demonstration data. A second LLM then evaluates environmental details, masking irrelevant elements (scoring them "0") and incorporating crucial ones ("1") into the robot's motion plan. This method significantly reduces the need for extensive training, requiring nearly five times less demonstration data. Masked IRL demonstrated up to a 15 percent improvement over comparable baselines in correctly identifying implicit user preferences, enabling robots to safely maneuver objects around obstacles in both simulated and real-world tasks, even with previously unseen prompts.

Key takeaway

For Robotics Engineers grappling with ambiguous user instructions or extensive data requirements, you should consider integrating dual LLM architectures like Masked IRL. This approach allows your autonomous systems to interpret vague commands and filter environmental noise, significantly reducing the need for extensive kinesthetic demonstrations. Your robots can learn complex tasks more efficiently and safely maneuver around unstated obstacles, improving performance by up to 15 percent in identifying implicit user preferences.

Key insights

Masked IRL uses dual LLMs to clarify vague robot instructions and filter environmental noise, reducing demonstration data needs.

Principles

Method

Masked IRL uses one LLM to elaborate on kinesthetic demonstration trajectories and vague prompts, then a second LLM to mask irrelevant environmental details, scoring them 1 (important) or 0 (irrelevant) for motion planning.

In practice

Topics

Best for: Research Scientist, AI Scientist, Robotics Engineer, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by MIT News - Artificial intelligence.