Beyond Partner Diversity: An Influence-Based Team Steering Framework for Zero-Shot Human-Machine Teaming

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, extended

Summary

Influence-Based Team Steering (IBTS) is a novel framework designed to enhance zero-shot human-machine teaming (HMT) by addressing the limitations of diversity-only approaches, particularly in sparse-reward environments and complex, multi-agent coordination scenarios. Developed by Wei Sheng and Rohan Paleja from Purdue University, IBTS integrates influence shaping to foster diverse, high-performing team interaction patterns during agent training and then uses a predictor-guided steering mechanism to guide ongoing interactions toward these learned coordination modes. The framework was evaluated on the Overcooked-AI platform in both two-agent and three-agent settings, including a 30-subject human study involving two human teammates and one machine teammate. Across simulated, synthetic, and real-human evaluations, IBTS consistently improved team performance over baselines like FCP, MEP, and GAMMA, demonstrating its effectiveness in scenarios requiring longer interaction chains and larger teams.

Key takeaway

For research scientists developing scalable human-machine teaming solutions, IBTS offers a robust framework to overcome the limitations of diversity-only approaches in complex, sparse-reward environments. You should consider integrating influence shaping and predictor-guided steering into your multi-agent reinforcement learning pipelines to foster more effective and transferable coordination patterns, especially when scaling to multi-human, multi-AI teams. This approach can significantly improve team performance and adaptability with unseen human partners.

Key insights

IBTS improves human-machine teaming by combining partner diversity with influence-based coordination shaping and trajectory steering.

Principles

Method

IBTS constructs a diverse team pool using influence shaping and behavioral diversity, trains a transformer-based predictor to recognize coordination patterns from trajectory histories, and then steers best-response policies toward higher-quality historical coordination modes.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Robotics Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.