Exploration Structure in LLM Agents for Multi-File Change Localization
Summary
A study on LLM agents for multi-file change localization reveals that traditional linear repository exploration is inefficient for changes spanning multiple subsystems. Researchers compared this against a non-linear, domain-scoped parallel agentic exploration approach using SWE Bench Pro (ansible) and an expanded benchmark including 2025/2026 PRs. The domain-scoped parallel agent spawning, utilizing a small Haiku-class model, achieved the highest micro F1 among its class, and was second only to the larger Codex 5.5 High on the expanded benchmark. Additional findings indicate documentation evolution as an unresolved latent dependency, naive file system access degrading localization due to test-file overprediction, and forced multi-agent consultation increasing token cost without measurable benefit.
Key takeaway
For AI Engineers designing LLM agents for software issue resolution, prioritize implementing non-linear, domain-scoped parallel exploration. This approach significantly improves multi-file change localization, outperforming linear methods. You should also avoid naive file system access, which can degrade localization, and recognize that forcing multi-agent consultation increases token costs without measurable performance gains.
Key insights
Linear LLM agent exploration is structurally mismatched for multi-subsystem software changes, benefiting from domain-scoped parallel approaches.
Principles
- Linear exploration mismatches multi-subsystem changes.
- Domain-scoped parallel agents improve localization.
- Forced multi-agent consultation raises cost without benefit.
Method
Construct persistent-session evaluation of GitHub issues anchored at a single base commit, comparing linear versus non-linear domain-scoped parallel agentic exploration.
In practice
- Implement domain-scoped parallel agents for complex changes.
- Avoid naive file system access to prevent overprediction.
- Do not force multi-agent consultation for efficiency.
Topics
- LLM Agents
- Software Engineering
- Change Localization
- SWE Bench Pro
- Domain-scoped Agents
- Haiku-class Models
Best for: Machine Learning Engineer, AI Scientist, Research Scientist, AI Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.