Why Solve It Twice? Hierarchical Accumulation of Skills for Transfer-Efficient ML Engineering

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Multiagent Systems · Depth: Expert, quick

Summary

HASTE, a hierarchical multi-agent system, addresses the inefficiency of ML engineering agents repeatedly rediscovering known techniques. It organizes cross-competition knowledge into global, domain, and competition-specific tiers, each managed by a corresponding agent level. An orchestrator uses LLM-driven abstraction to coordinate domain specialists and facilitate learning across these tiers. In a controlled ablation study, HASTE's tiered loading, using a 159-skill inventory across 8 competitions, achieved a 100% medal rate, significantly outperforming flat loading's 62.5% medal rate and consuming 2x fewer output tokens. On the MLE-Bench Lite benchmark (22 Kaggle competitions), HASTE achieved a 77.3% medal rate with Claude Sonnet 4.6, spending 12 hours per competition. Warm-start runs, leveraging previously learned skills, reduced refinement iterations by 52% and increased the acceptance rate of proposed changes from 42% to 85% with 50+ skills.

Key takeaway

For ML engineering teams designing autonomous agents, HASTE's findings indicate that implementing hierarchical skill accumulation is critical. You should prioritize systems that organize knowledge into global, domain, and competition-specific tiers to significantly boost transfer efficiency and reduce compute waste. This approach allows your agents to achieve higher success rates, like 100% medal rates, and drastically cut refinement iterations by 52% in warm-start scenarios, improving overall productivity.

Key insights

Hierarchical knowledge organization in multi-agent ML engineering systems significantly improves transfer efficiency and performance.

Principles

Method

HASTE employs a hierarchical multi-agent system with global, domain, and competition-specific tiers. An orchestrator coordinates agents and promotes learning between tiers via LLM-driven abstraction for efficient skill transfer.

In practice

Topics

Best for: Research Scientist, AI Engineer, AI Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.