Exploration Structure in LLM Agents for Multi-File Change Localization

· Source: Takara TLDR - Daily AI Papers · Field: Technology & Digital — Software Development & Engineering, Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

A study on LLM agents for multi-file change localization reveals that traditional linear repository exploration is inefficient for changes spanning multiple subsystems. Researchers compared this against a non-linear, domain-scoped parallel agentic exploration approach using SWE Bench Pro (ansible) and an expanded benchmark including 2025/2026 PRs. The domain-scoped parallel agent spawning, utilizing a small Haiku-class model, achieved the highest micro F1 among its class, and was second only to the larger Codex 5.5 High on the expanded benchmark. Additional findings indicate documentation evolution as an unresolved latent dependency, naive file system access degrading localization due to test-file overprediction, and forced multi-agent consultation increasing token cost without measurable benefit.

Key takeaway

For AI Engineers designing LLM agents for software issue resolution, prioritize implementing non-linear, domain-scoped parallel exploration. This approach significantly improves multi-file change localization, outperforming linear methods. You should also avoid naive file system access, which can degrade localization, and recognize that forcing multi-agent consultation increases token costs without measurable performance gains.

Key insights

Linear LLM agent exploration is structurally mismatched for multi-subsystem software changes, benefiting from domain-scoped parallel approaches.

Principles

Method

Construct persistent-session evaluation of GitHub issues anchored at a single base commit, comparing linear versus non-linear domain-scoped parallel agentic exploration.

In practice

Topics

Best for: Machine Learning Engineer, AI Scientist, Research Scientist, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.