WISE: A Long-Horizon Agent in Minecraft with Why-Which Reasoning
Summary
WISE (Which-Why Informed Semantic Explorer), a long-horizon agent framework for Minecraft, addresses limitations of existing LLM-augmented hierarchical agents, specifically low-level controller bottlenecks due to repeated execution failures and decoupled memory/reasoning. It introduces a Causal Event Graph to augment episodic memory with explicit causal structures, linking observations to task relevance for robust recall under viewpoint changes and opportunistic task reordering. WISE also features an Opportunistic Task Scheduler for dynamic subtask reprioritization and a multi-scale progressive exploration strategy. Experiments demonstrate WISE's effectiveness, showing a 30% increase in sequential sparse task success with 26.4% lower completion time, and a 44% increase in adaptive non-sequential task success with 42.5% less completion time compared to MrSteve. It further achieved 97% map coverage on a 128x128 real-world map, significantly outperforming MrSteve's 67%.
Key takeaway
For AI Scientists developing embodied agents for complex, open-ended environments like Minecraft, traditional hierarchical LLM-augmented agents often bottleneck at low-level control due to rigid memory and planning. You should integrate semantic causal memory and opportunistic scheduling to enable adaptive decision-making and robust recall. Consider multi-scale exploration to ensure comprehensive observations, significantly improving task success and efficiency in sparse, long-horizon scenarios.
Key insights
WISE unifies exploration, memory, and planning in Minecraft agents via causal reasoning for adaptive, efficient long-horizon task execution.
Principles
- Causal reasoning enhances episodic memory beyond visual similarity.
- Dynamic task scheduling improves efficiency by exploiting opportunities.
- Multi-scale exploration ensures comprehensive environmental coverage.
Method
WISE uses a Causal Event Graph for semantic memory, an Opportunistic Task Scheduler for dynamic subtask prioritization, and a multi-scale progressive exploration strategy (quadtree, frontier, Voronoi) for comprehensive observation acquisition.
In practice
- Implement a Causal Event Graph for semantic memory in embodied agents.
- Integrate an opportunistic scheduler for dynamic task re-prioritization.
- Employ multi-scale exploration for efficient environment coverage.
Topics
- Embodied AI Agents
- Minecraft Environment
- Causal Reasoning
- Large Language Models
- Episodic Memory
- Task Scheduling
- Multi-scale Exploration
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Robotics Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.