The Provenance Paradox in Multi-Agent LLM Routing: Delegation Contracts and Attested Identity in LDP

· Source: cs.MA updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Advanced, long

Summary

The paper introduces critical extensions to the LLM Delegate Protocol (LDP) to address the "provenance paradox" in multi-agent LLM systems, where routing based on self-reported quality scores systematically selects the worst delegates. It extends LDP with delegation contracts that define explicit objectives, budgets, and failure policies for delegated tasks. A claimed-vs-attested identity model is added to distinguish self-reported quality from verified metrics (runtime_observed, issuer_attested, externally_benchmarked). Furthermore, typed failure semantics replace unstructured error strings, enabling automated recovery. Controlled experiments with 10 simulated delegates and validation with real Claude models demonstrate that self-claimed routing performs worse than random (simulated: 0.55 vs. 0.68; real models: 8.90 vs. 9.30), while attested routing achieves near-optimal performance (0.95 simulated, 9.80 real). These backward-compatible extensions incur sub-microsecond validation overhead.

Key takeaway

For AI Scientists designing multi-agent LLM systems, relying solely on self-reported quality metrics for task routing is detrimental. You should prioritize implementing protocols that support attested identity, such as the extended LDP, to ensure routing selects capable agents based on verified performance. Incorporating delegation contracts and typed failure semantics will further enhance system reliability and auditability, preventing the "provenance paradox" from degrading overall system performance.

Key insights

Unverified self-claimed quality in multi-agent LLM routing leads to systematically selecting the worst delegates.

Principles

Method

Extend LLM Delegate Protocol (LDP) with delegation contracts for bounded authority, a claimed-vs-attested identity model for verified quality, and typed failure semantics for automated recovery.

In practice

Topics

Code references

Best for: AI Scientist, Research Scientist, CTO, AI Engineer, Machine Learning Engineer, AI Architect

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.MA updates on arXiv.org.