Toward Instructions-as-Code: Understanding the Impact of Instruction Files on Agentic Pull Requests
Summary
A study analyzing 15,549 agentic pull requests (PRs) from 148 projects in the AIDev dataset investigated the impact of developer-created instruction files on AI-agent performance. The research found that while instruction files guide agents, they do not guarantee improved efficiency. Specifically, 27.7% of projects saw a merge rate increase of at least 20%, but 26.35% experienced a decrease. Projects with increased merge rates had significantly longer instruction files (median 976 words vs. 569 words) and more structured content, particularly with H3 headers. The study also observed mixed effects on task complexity and merge effort, with some projects showing increased complexity and others increased time/discussion for merging.
Key takeaway
For software engineers integrating AI agents like Copilot into development workflows, simply providing instruction files is insufficient. You should invest in creating comprehensive, well-structured instructions, similar to developing code. Projects with higher merge rates for agentic PRs utilized significantly longer files (median 976 words) with more detailed H3 sections. Treat instruction file creation as a critical software engineering task to maximize agent efficiency and reduce merge friction.
Key insights
Effective AI-agent instruction files are substantially longer and more deeply structured, not just present.
Principles
- Instruction files alone do not guarantee AI-agent performance improvement.
- Longer, well-structured instruction files correlate with higher agentic PR merge rates.
- Treating instruction file development as a software engineering activity is crucial.
Method
The study compared agentic PR metrics (merge rate, complexity, effort) before and after instruction file creation across 148 projects, using Mann-Whitney U tests and Cliff's delta for statistical significance.
In practice
- Prioritize detailed, multi-section instruction files for AI agents.
- Monitor agentic PR metrics after implementing instruction files.
- Structure instructions with H1, H2, and especially H3 headers for clarity.
Topics
- Agentic AI
- Pull Requests
- Instruction Files
- Software Engineering
- AI Agents
- Code Generation
Code references
Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, Software Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.SE updates on arXiv.org.