WISE: A Long-Horizon Agent in Minecraft with Why-Which Reasoning

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, extended

Summary

WISE (Which-Why Informed Semantic Explorer), a long-horizon agent framework for Minecraft, addresses limitations of existing LLM-augmented hierarchical agents, specifically low-level controller bottlenecks due to repeated execution failures and decoupled memory/reasoning. It introduces a Causal Event Graph to augment episodic memory with explicit causal structures, linking observations to task relevance for robust recall under viewpoint changes and opportunistic task reordering. WISE also features an Opportunistic Task Scheduler for dynamic subtask reprioritization and a multi-scale progressive exploration strategy. Experiments demonstrate WISE's effectiveness, showing a 30% increase in sequential sparse task success with 26.4% lower completion time, and a 44% increase in adaptive non-sequential task success with 42.5% less completion time compared to MrSteve. It further achieved 97% map coverage on a 128x128 real-world map, significantly outperforming MrSteve's 67%.

Key takeaway

For AI Scientists developing embodied agents for complex, open-ended environments like Minecraft, traditional hierarchical LLM-augmented agents often bottleneck at low-level control due to rigid memory and planning. You should integrate semantic causal memory and opportunistic scheduling to enable adaptive decision-making and robust recall. Consider multi-scale exploration to ensure comprehensive observations, significantly improving task success and efficiency in sparse, long-horizon scenarios.

Key insights

WISE unifies exploration, memory, and planning in Minecraft agents via causal reasoning for adaptive, efficient long-horizon task execution.

Principles

Method

WISE uses a Causal Event Graph for semantic memory, an Opportunistic Task Scheduler for dynamic subtask prioritization, and a multi-scale progressive exploration strategy (quadtree, frontier, Voronoi) for comprehensive observation acquisition.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Robotics Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.