AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

· Source: cs.SE updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Expert, extended

Summary

AutoPass is a multi-agent framework designed for LLVM compiler performance tuning, leveraging Large Language Models (LLMs) to guide optimization decisions. Unlike traditional black-box autotuning, AutoPass integrates LLMs directly with compiler-internal optimization states and intermediate representation analysis. Operating in an inference-only, training-free setting, it requires no offline training or task-specific fine-tuning, making it highly adaptable. Implemented on the LLVM compiler, AutoPass was evaluated on server-grade x86-64 and embedded ARM64 systems. It consistently outperformed expert-tuned heuristics and classical autotuning methods, achieving geometric-mean speedups of 1.043x on x86-64 and 1.117x on ARM64 over the LLVM -O3 baseline. The framework comprises a Score Agent for hotspot identification, an Analysis Agent for feature extraction, a Reasoning Agent for core optimization, and an Evaluation Agent for performance feedback, demonstrating architecture-aware optimization and robust iterative refinement.

Key takeaway

For Machine Learning Engineers or Compiler Engineers optimizing code performance, particularly on diverse architectures like x86-64 and ARM64, you should explore multi-agent LLM frameworks for compiler tuning. AutoPass demonstrates that integrating LLMs with compiler-internal feedback and runtime measurements can yield superior speedups over -O3, such as 1.043x on x86-64 and 1.117x on ARM64, without requiring offline training. Implement iterative refinement and a rollback policy to ensure robust, architecture-aware optimization and mitigate initial performance regressions.

Key insights

LLMs effectively guide compiler optimization by integrating compiler-internal signals and runtime feedback without offline training.

Principles

Method

A multi-agent LLM framework iteratively refines compiler pass sequences and parameters, guided by compiler remarks, IR analysis, and measured runtime performance feedback.

In practice

Topics

Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, Software Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.SE updates on arXiv.org.