Self-Evolving Multi-Agent Framework for Efficient Decision Making in Real-Time Strategy Scenarios
Summary
SEMA (Self-Evolving Multi-Agent) is a novel framework designed to enhance decision-making in Real-Time Strategy (RTS) scenarios like StarCraft II, addressing the speed-quality trade-off inherent in Large Language Models (LLMs). The framework employs a collaborative multi-agent system that self-evolves by adaptively calibrating model bias through in-episode assessment and cross-episode analysis. It integrates dynamic observation pruning based on structural entropy to topologically model game states, distilling high-dimensional data into core semantic information and significantly reducing inference time. Additionally, SEMA utilizes a hybrid knowledge-memory mechanism combining micro-trajectories, macro-experience, and hierarchical domain knowledge to improve strategic adaptability and decision consistency. Experiments on multiple StarCraft II maps show SEMA achieves superior win rates and reduces average decision latency by over 50%, demonstrating its efficiency and robustness.
Key takeaway
For research scientists developing autonomous agents in dynamic, high-dimensional environments like RTS games, SEMA offers a blueprint for overcoming LLM latency and inconsistency. You should consider integrating structural entropy-driven observation pruning and a multi-agent collaborative architecture to achieve real-time, high-quality decision-making without extensive fine-tuning. This approach can significantly improve agent robustness and performance in complex adversarial settings.
Key insights
SEMA is a multi-agent LLM framework for RTS that uses structural entropy and hybrid memory to achieve fast, consistent decisions.
Principles
- Adaptive bias calibration improves LLM consistency.
- Topological modeling reduces high-dimensional observation data.
- Hybrid memory enhances strategic adaptability.
Method
SEMA employs a decision-making, evaluation, and policy agent system. It uses structural entropy for dynamic observation pruning, and a hybrid knowledge-memory mechanism for cross-round strategy evolution and real-time action correction.
In practice
- Apply structural entropy for data compression in dynamic environments.
- Implement multi-agent systems for complex task decomposition.
- Utilize hybrid memory for long-term strategic consistency.
Topics
- Real-Time Strategy
- Multi-Agent Systems
- Large Language Models
- Structural Entropy
- StarCraft II
Best for: Research Scientist, AI Researcher, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.MA updates on arXiv.org.