When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges

· Source: cs.SE updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Natural Language Processing · Depth: Expert, extended

Summary

A study on multi-objective prompt optimization for LLM judges reveals significant failure modes when extending single-objective textual gradient methods to multi-criteria evaluation. Researchers tested five decomposition modes (Single-Task, SSS, SSC, SCC, CCC) on SummEval, evaluating four criteria. Results showed that in 6 of 10 configurations, optimization never improved over the initial prompt. A key finding was a 59% drop in gradient specificity (from 9.0 to 3.7) when the gradient LLM processed multiple criteria jointly. Furthermore, combining individually optimal per-task instructions degraded Spearman's ρ by -0.053, indicating inference-time issues. These findings identify two distinct bottlenecks: optimization-time "gradient dilution" and inference-time "instruction interference," which constrain effective multi-objective judge design.

Key takeaway

For AI Scientists customizing LLM judges for multi-criteria tasks, be aware that current textual gradient optimization methods are unreliable. Your efforts may be hampered by "gradient dilution" during optimization and "instruction interference" at inference time, leading to performance degradation. You should prioritize architectural changes, such as implementing per-task decomposition for gradient generation or exploring conflict-aware gradient resolution, to achieve reliable multi-objective prompt improvements.

Key insights

Multi-objective prompt optimization for LLM judges fails due to gradient dilution and instruction interference.

Principles

Method

The study implemented a 4-stage TextGrad pipeline (task model, loss LLM, gradient LLM, optimizer LLM) and tested five decomposition modes on SummEval.

In practice

Topics

Best for: AI Scientist, Research Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.SE updates on arXiv.org.