The Query Channel: Information-Theoretic Limits of Masking-Based Explanations

Erciyes Karakaya; Ozgur Ercetin

arxiv: 2604.16689 · v2 · pith:6MH7MKDCnew · submitted 2026-04-17 · 💻 cs.AI

The Query Channel: Information-Theoretic Limits of Masking-Based Explanations

Erciyes Karakaya , Ozgur Ercetin This is my paper

Pith reviewed 2026-05-10 08:06 UTC · model grok-4.3

classification 💻 cs.AI

keywords query channelmasking-based explanationsinformation theoryidentification capacityLIMEKernelSHAPstrong converseblack-box explanations

0 comments

The pith

Masking-based explanations can be recovered reliably only when their rate stays below the query channel's identification capacity.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper recasts the masking procedure used by LIME and KernelSHAP as transmission of a message (the explanation) over a query channel whose uses are randomized perturbations of the black-box input. It measures explanation complexity by the entropy of the hypothesis class and shows that the channel supplies information at a fixed identification capacity per query. A strong converse proves that any explainer whose rate exceeds this capacity must fail to recover the explanation exactly with probability approaching one. An achievability result shows that a sparse maximum-likelihood decoder succeeds below capacity. The framework therefore supplies a fundamental limit on how complex an explanation can be for a given query budget.

Core claim

By treating each masked evaluation as a channel use, the paper proves that reliable recovery of the latent explanation is possible if and only if the rate (entropy per query) lies below the identification capacity of the query channel. Above capacity a strong converse shows that the error probability converges to one for every sequence of explainers and decoders; below capacity a sparse maximum-likelihood decoder attains vanishing error. A Monte Carlo mutual-information estimator supplies a practical benchmark that reveals operating regimes where information theory permits exact recovery yet Lasso- and OLS-based surrogates still fail.

What carries the argument

The query channel, in which the latent explanation is the message and each randomized masking evaluation of the black-box model is one channel use, whose identification capacity sets the maximum rate for reliable recovery.

If this is right

Super-pixel or token resolutions function as source-coding choices that set the entropy of the explanation and must be matched to the query capacity.
Gaussian noise and nonlinear curvature in the black-box reduce the effective capacity and produce waterfall and error-floor behavior.
There exist finite query budgets where information theory guarantees reliable recovery is possible but current convex surrogates still produce large errors.
High-resolution explanations become unattainable once the entropy induced by fine-grained masking exceeds the per-query capacity.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

New explanation algorithms could be designed to approach the capacity bound by replacing convex surrogates with decoders closer to the maximum-likelihood rule.
The same channel model could be applied to non-masking explanation techniques to derive comparable rate limits.
Measuring the effective capacity on large language models would quantify how much query budget is wasted by tokenization choices.

Load-bearing premise

Randomized masking perturbations supply information at a well-defined identification capacity per query that does not depend on the particular black-box model in a way that would invalidate the uniform converse.

What would settle it

A concrete experiment in which an explainer and decoder recover explanations whose entropy exceeds the Monte Carlo estimated capacity with error probability that remains bounded away from one.

Figures

Figures reproduced from arXiv: 2604.16689 by Erciyes Karakaya, Ozgur Ercetin.

**Figure 2.** Figure 2: Unified information-theoretic and algorithmic behavior for the sparse [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗

**Figure 3.** Figure 3: Sample complexity at fixed query budget and varying resolution. Top: conceptual illustration comparing pixel-level SHAP at very high dimension [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗

**Figure 4.** Figure 4: Block error probability for dense (Ridge / OLS proxy) and sparse [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗

**Figure 5.** Figure 5: Empirical relationship between information content [PITH_FULL_IMAGE:figures/full_fig_p012_5.png] view at source ↗

**Figure 6.** Figure 6: Effect of segmentation resolution for texts on prediction performance. Left: Next-token prediction accuracy on an in-context learning (ICL) task as a function of resolution, where resolution denotes the maximum allowed subword length under greedy encoding. Both oversegmentation (small resolution) and undersegmentation (large resolution) degrade performance, with an optimal region emerging at intermediate r… view at source ↗

read the original abstract

Masking-based post-hoc explanation methods, such as KernelSHAP and LIME, estimate local feature importance by querying a black-box model under randomized perturbations. This paper formulates this procedure as communication over a query channel, where the latent explanation acts as a message and each masked evaluation is a channel use. Within this framework, the complexity of the explanation is captured by the entropy of the hypothesis class, while the query interface supplies information at a rate determined by an identification capacity per query. We derive a strong converse showing that, if the explanation rate exceeds this capacity, the probability of exact recovery necessarily converges to one in error for any sequence of explainers and decoders. We also prove an achievability result establishing that a sparse maximum-likelihood decoder attains reliable recovery when the rate lies below capacity. A Monte Carlo estimator of mutual information yields a non-asymptotic query benchmark that we use to compare optimal decoding with Lasso- and OLS-based procedures that mirror LIME and KernelSHAP. Experiments reveal a range of query budgets where information theory permits reliable explanations but standard convex surrogates still fail. Finally, we interpret super-pixel resolution and tokenization for neural language models as a source-coding choice that sets the entropy of the explanation and show how Gaussian noise and nonlinear curvature degrade the query channel, induce waterfall and error-floor behavior, and render high-resolution explanations unattainable.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper models masking explanations as transmission over a query channel and derives capacity bounds plus Monte Carlo benchmarks, but the claimed uniform strong converse may not survive model dependence.

read the letter

This paper treats the masking queries in methods like LIME and KernelSHAP as uses of a communication channel whose capacity limits how complex an explanation you can recover reliably. The main new piece is the application of strong converse theorems to show that exceeding the rate (hypothesis entropy over queries) forces error probability to one, plus an achievability result for a sparse ML decoder below capacity. They also supply a Monte Carlo mutual-information estimator as a non-asymptotic benchmark and run it against Lasso and OLS surrogates that mimic the standard methods. Experiments indicate regimes where the information-theoretic limit allows recovery but the convex proxies still fail, and they tie super-pixel or token choices to source-coding rate limits while noting how noise and curvature create error floors. That framing is fresh and gives a clean way to think about why high-resolution explanations are hard in practice. The soft spot is the uniformity assumption: capacity is induced by the unknown black-box response to masks, so it varies with the model. If some black-boxes make the channel more informative, the threshold is not a universal ceiling that applies to every explainer, which undercuts the strong converse claim for arbitrary cases. The abstract states the proofs exist but the provided text gives no channel definitions or derivation steps, so verification is impossible here. The Monte Carlo setup also lacks enough detail on black-box choices and sampling to judge robustness. This is for readers already working at the intersection of information theory and XAI who want bounds instead of new heuristics. A practitioner looking for immediate fixes will find little; someone wanting to organize evaluation of explanation methods will see value. The work is coherent enough on its own terms to deserve referee time, even with the open questions on assumptions and proofs.

Referee Report

2 major / 2 minor

Summary. The paper models masking-based post-hoc explanation methods such as LIME and KernelSHAP as communication over a query channel, where the latent explanation is the message and each randomized masked evaluation of the black-box is a channel use. Explanation complexity is measured by the entropy of the hypothesis class, and the query interface provides information at an identification capacity per query. The central results are a strong converse showing that if the explanation rate (entropy divided by number of queries) exceeds this capacity then the probability of exact recovery converges to 1 for any sequence of explainers and decoders, plus an achievability result that a sparse maximum-likelihood decoder succeeds reliably below capacity. A Monte Carlo mutual-information estimator supplies a non-asymptotic benchmark used to compare optimal decoding against Lasso- and OLS-based procedures; experiments identify query-budget regimes where information theory permits reliable recovery but the convex surrogates fail. The work also interprets super-pixel resolution and tokenization as source-coding choices and analyzes how Gaussian noise and nonlinear curvature degrade the channel.

Significance. If the results hold, the paper supplies the first information-theoretic characterization of fundamental query limits for masking-based explanations, which could guide the design of query-efficient explainers and clarify when high-resolution explanations are information-theoretically impossible. The Monte Carlo benchmark and direct comparisons to LIME/KernelSHAP-style procedures are practical strengths, as is the explicit treatment of resolution choices as source coding. The derivation of both a strong converse and an achievability result, together with the reproducible Monte Carlo estimator, constitutes a solid technical contribution.

major comments (2)

[Abstract] Abstract and model definition: the strong converse is stated to apply uniformly 'for any sequence of explainers and decoders' and to constrain all practical masking explainers, yet the channel law P(black-box output | mask) is induced by the unknown black-box function itself. It is therefore unclear whether the resulting identification capacity remains bounded independently of the black-box or can be made arbitrarily large by suitable choice of f, which would prevent the rate threshold from serving as a universal limit. This assumption is load-bearing for the universality claim.
[Abstract] Abstract: the manuscript asserts derivation of a 'strong converse' and an 'achievability result' together with Monte Carlo experiments, but the provided text does not include the full channel definitions, the precise statement of the capacity, or the proof sketches. Without these, the central claims cannot be verified at the level required for a journal.

minor comments (2)

Notation for the query channel, identification capacity, and explanation rate should be introduced with explicit equations in the main text rather than relying on the abstract.
The Monte Carlo estimator of mutual information is described only at a high level; adding pseudocode or implementation details would improve reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the thoughtful and constructive review. We address each major comment below and indicate the revisions planned for the next version of the manuscript.

read point-by-point responses

Referee: [Abstract] Abstract and model definition: the strong converse is stated to apply uniformly 'for any sequence of explainers and decoders' and to constrain all practical masking explainers, yet the channel law P(black-box output | mask) is induced by the unknown black-box function itself. It is therefore unclear whether the resulting identification capacity remains bounded independently of the black-box or can be made arbitrarily large by suitable choice of f, which would prevent the rate threshold from serving as a universal limit. This assumption is load-bearing for the universality claim.

Authors: The identification capacity is defined for the specific channel law induced by the fixed but arbitrary black-box f. The strong converse establishes that, for any such f, if the explanation rate exceeds the capacity of the induced query channel, then the probability of exact recovery converges to zero for every sequence of explainers and decoders. The result is therefore universal with respect to the choice of explainer and decoder, while the numerical value of the capacity naturally depends on f. This dependence is expected, as different black-box functions convey different amounts of information per masked query. We do not assert a single numerical bound independent of f. To remove any ambiguity, we will revise the abstract and the model section to state explicitly that the channel law and capacity are induced by f and to emphasize that the converse applies uniformly over explainers for any given f. revision: yes
Referee: [Abstract] Abstract: the manuscript asserts derivation of a 'strong converse' and an 'achievability result' together with Monte Carlo experiments, but the provided text does not include the full channel definitions, the precise statement of the capacity, or the proof sketches. Without these, the central claims cannot be verified at the level required for a journal.

Authors: The full manuscript defines the query channel and the induced channel law in Section 2, states the identification capacity, presents the strong converse and achievability theorems in Section 3 with proof sketches, and supplies complete proofs in the appendix. The Monte Carlo mutual-information estimator and its use as a benchmark are detailed in Section 4. Nevertheless, we agree that the abstract is highly condensed. We will expand the abstract to include a concise statement of the channel model and capacity, and we will ensure that the introduction highlights the theorem statements and proof outlines for easier verification. revision: yes

Circularity Check

0 steps flagged

No circularity: standard channel coding theorems applied to query channel model

full rationale

The paper models masking-based explanations as transmission over a query channel, with explanation rate defined as hypothesis-class entropy divided by number of queries and capacity defined via mutual information between latent explanation and black-box outputs under masks. The strong converse (error probability to 1 above capacity) and achievability (reliable recovery below capacity via sparse ML decoder) are obtained by direct application of classical information-theoretic results on identification capacity; these theorems are external and do not reduce to any fitted parameter, self-definition, or ansatz within the paper. The Monte Carlo MI estimator is used solely for non-asymptotic benchmarking against LIME/KernelSHAP surrogates and plays no role in the asymptotic derivation. No load-bearing self-citations, uniqueness theorems imported from the authors, or renaming of known results are present. The derivation chain is therefore self-contained and independent of its own inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 1 invented entities

The central claim rests on modeling masking explanations as a memoryless query channel whose capacity is given by identification rate, plus standard information-theoretic theorems; no explicit free parameters are fitted to data in the theoretical claims.

axioms (2)

standard math Strong converse theorem for discrete memoryless channels
Invoked to prove that rate above capacity forces error probability to 1.
domain assumption Achievability of reliable recovery below capacity with sparse ML decoder
Assumes the hypothesis class and query responses allow the decoder to attain the rate.

invented entities (1)

Query channel no independent evidence
purpose: Abstract model of information transfer from randomized masked evaluations to explanation recovery
New modeling abstraction introduced to apply channel coding tools to post-hoc explanations.

pith-pipeline@v0.9.0 · 5545 in / 1617 out tokens · 40126 ms · 2026-05-10T08:06:36.153392+00:00 · methodology

The Query Channel: Information-Theoretic Limits of Masking-Based Explanations

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)