The Query Channel: Information-Theoretic Limits of Masking-Based Explanations

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Mathematics & Computational Sciences · Depth: Expert, extended

Summary

Masking-based post-hoc explanation methods like KernelSHAP and LIME are fundamentally limited by information-theoretic principles, according to new research. This work models explanation extraction as communication over a "query channel," where the latent explanation is a message and each masked evaluation is a channel use. It introduces concepts like explanation complexity (entropy of the hypothesis class) and query interface information rate (identification capacity). A strong converse theorem proves that if the explanation rate exceeds this capacity, exact recovery probability converges to one in error. Conversely, an achievability result shows a sparse maximum-likelihood decoder can reliably recover explanations when the rate is below capacity. Experiments reveal an "algorithmic gap" where information theory allows reliable explanations, but standard convex surrogates (Lasso, OLS) fail. The analysis also identifies a critical resolution for super-pixels and tokenization, beyond which explanations become information-theoretically unattainable due to channel degradation from noise and nonlinear curvature.

Key takeaway

For AI Scientists and Research Scientists designing or evaluating masking-based XAI, you must consider the information-theoretic limits of your query channel. If your explanation's entropy exceeds the channel's capacity, reliable recovery is mathematically impossible, regardless of computational power. You should carefully balance explanation resolution, sparsity, and query budget to remain within information-theoretically feasible regimes. Persistent instability or error floors often signal that you are operating beyond the channel's capacity, not merely facing algorithmic deficiencies.

Key insights

Masking-based explanations are fundamentally limited by information-theoretic rate-capacity tradeoffs, dictating recoverability.

Principles

Method

Formulate explanation as communication over a noisy "query channel," quantifying explanation complexity via entropy and query information via parameter-identification capacity. Compare explanation rate R=H(M)/T with C^(S).

In practice

Topics

Best for: AI Scientist, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.