DiG-Plan: Mitigating Early Commitment for Tool-Graph Planning via Diffusion Guidance
Summary
DiG-Plan, a framework for tool-graph planning, addresses the "early commitment" problem in autoregressive (AR) decoding for combinatorial tool selection. A controlled study demonstrated that masked denoising significantly improved Pass@10 solution coverage from 0.320 to 0.943 over AR sampling under matched compute. Motivated by this, DiG-Plan decouples combinatorial exploration from structural refinement, employing a diffusion-based proposer to generate diverse tool sets via iterative refinement, followed by an AR refiner for dependency prediction. The framework also incorporates a judge-free value function for optimal candidate selection. On TaskBench, DiG-Plan achieved a 10% relative improvement over AR baselines, with the largest gains observed on complex compositional tasks, and showed cross-domain effectiveness on API-Bank.
Key takeaway
For AI Scientists or Machine Learning Engineers developing tool-augmented LLMs, recognize that standard autoregressive decoding limits combinatorial tool-set exploration due to early commitment. You should consider adopting a propose-refine-select framework like DiG-Plan. This involves using diffusion models for diverse tool-set generation and autoregressive models for dependency prediction. Implement a judge-free value function to select optimal plans, improving performance by 10% on complex tasks.
Key insights
Autoregressive decoding's "early commitment" limits tool-set exploration; diffusion models offer superior combinatorial search.
Principles
- Autoregressive decoding struggles with combinatorial search due to early commitment.
- Decoupling tool-set exploration from dependency prediction improves planning.
- Diffusion models enhance diversity in combinatorial search spaces.
Method
DiG-Plan uses a diffusion proposer for diverse tool sets, an AR refiner for dependency edges, and a judge-free value function to select the best plan.
In practice
- Use diffusion models for diverse candidate tool-set generation.
- Employ AR models for structured dependency prediction given fixed tool sets.
- Implement a judge-free value function for optimal plan selection.
Topics
- Tool-Augmented LLMs
- Diffusion Models
- Autoregressive Decoding
- Tool-Graph Planning
- Combinatorial Search
- TaskBench
Code references
Best for: Research Scientist, NLP Engineer, AI Scientist, Machine Learning Engineer, AI Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.