Mango: Multi-Agent Web Navigation via Global-View Optimization
Summary
Mango is a multi-agent web navigation method designed to improve efficiency on complex websites by optimizing starting points. Unlike traditional agents that begin from the root URL, Mango utilizes a global view of the website's structure to dynamically select optimal initial navigation points. It frames URL selection as a multi-armed bandit problem, employing Thompson Sampling to adaptively distribute the navigation budget across potential URLs. Additionally, Mango incorporates an episodic memory to store and learn from past navigation attempts. Evaluated on WebVoyager, Mango achieved a 63.6% success rate with GPT-5-mini, surpassing the best baseline by 7.3%. On WebWalkerQA, it reached a 52.5% success rate, outperforming the best baseline by 26.8%. The method demonstrates generalizability across various open-source and closed-source models.
Key takeaway
For research scientists developing web agents, understanding Mango's approach to global-view optimization and adaptive URL selection is crucial. Your current root-URL-centric exploration methods may be inefficient on complex sites. Consider integrating multi-armed bandit strategies and episodic memory into your agent designs to significantly improve navigation success rates and resource utilization, as demonstrated by Mango's 26.8% improvement on WebWalkerQA.
Key insights
Optimizing web agent navigation through global-view URL selection and adaptive budget allocation significantly boosts success rates.
Principles
- Global website view enhances navigation.
- Adaptive budget allocation improves efficiency.
- Episodic memory aids learning from attempts.
Method
Mango formulates URL selection as a multi-armed bandit problem, using Thompson Sampling for budget allocation and episodic memory for learning from navigation history.
In practice
- Use Thompson Sampling for URL selection.
- Implement episodic memory for agent learning.
- Leverage website structure for starting points.
Topics
- Mango
- Multi-Agent Web Navigation
- Global-View Optimization
- Multi-Armed Bandit
- Thompson Sampling
Code references
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.