Everywhere Valid Bounds on False Discovery Proportions in Conformal Inference
Pith reviewed 2026-05-21 02:44 UTC · model grok-4.3
The pith
Finite-sample bounds on the false discovery proportion hold simultaneously for all rejection thresholds in conformal inference.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper establishes finite-sample, distribution-free upper bounds on the FDP that hold simultaneously over all possible rejection thresholds, enabling arbitrary post hoc selection of the threshold. Simultaneous validity is achieved by constructing a high-probability envelope for the empirical distribution function of null conformal p-values by sampling from their joint distribution. The framework allows practitioners to modulate the envelope's shape, thereby producing tight bounds in rejection regions of primary interest, and applies this to derive simultaneous FDP upper bounds for both outlier detection and conformal selection.
What carries the argument
High-probability envelope for the empirical distribution function of null conformal p-values, built by sampling from their joint distribution.
If this is right
- The bounds support arbitrary post hoc selection of the rejection threshold while preserving statistical validity.
- The same envelope construction yields valid bounds for outlier detection and for conformal selection.
- Modulating the envelope shape produces tighter bounds in the rejection regions of primary practical interest.
- Synthetic and real-data experiments confirm that the bounds are valid yet substantially less conservative than existing methods.
Where Pith is reading between the lines
- The simultaneous validity could support more flexible exploratory analyses in settings where thresholds must be chosen after seeing preliminary results.
- The sampling-based envelope might extend to other post-selection problems that involve data-dependent choices beyond standard conformal p-values.
- In applied work the reduced conservatism could improve the power of selection procedures without sacrificing coverage.
Load-bearing premise
The construction requires the ability to sample from the joint distribution of the null conformal p-values.
What would settle it
Repeated simulations in which the observed FDP for a post-hoc chosen threshold exceeds the reported bound with frequency greater than the nominal error probability, or where the envelope fails to cover the realized null p-value distribution.
Figures
read the original abstract
Modern applications of conformal inference to multiple testing problems, such as outlier detection and candidate selection, often involve selecting test samples whose conformal p-values fall below a threshold. The quality of such methods is often measured by the false discovery proportion (FDP), defined as the fraction of incorrect selections. Existing approaches typically control the expected value of the FDP, using methods such as the Benjamini-Hochberg procedure. This approach fails to provide high-probability bounds on the realized false discovery proportion and invalidates statistical guarantees if the rejection threshold is selected after inspecting the data. This paper establishes finite-sample, distribution-free upper bounds on the FDP that hold simultaneously over all possible rejection thresholds, enabling arbitrary post hoc selection of the threshold. Simultaneous validity is achieved by constructing a high-probability envelope for the empirical distribution function of null conformal p-values by sampling from their joint distribution. Furthermore, our framework allows practitioners to modulate the envelope's shape, thereby producing tight bounds in rejection regions of primary interest. We use this flexible approach to derive simultaneous FDP upper bounds for both outlier detection and conformal selection. We demonstrate through synthetic and real-data experiments that the resulting bounds are both valid and substantially less conservative than those derived from existing approaches.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to establish finite-sample, distribution-free upper bounds on the false discovery proportion (FDP) that hold simultaneously over all possible rejection thresholds in conformal inference. This is achieved by constructing a high-probability envelope on the empirical distribution function of null conformal p-values through exact sampling from their joint distribution (which is uniform and rank-based, hence distribution-free). The framework further permits modulating the envelope shape to tighten bounds in regions of interest and is applied to derive simultaneous FDP bounds for outlier detection and conformal selection, with supporting synthetic and real-data experiments.
Significance. If the central construction holds, the work is significant for providing high-probability (rather than expectation-only) control on realized FDP while preserving validity under arbitrary post-hoc threshold choice. This directly addresses a practical limitation of procedures like Benjamini-Hochberg in conformal multiple-testing settings and leverages the exact samplability of conformal null p-values to obtain non-asymptotic, distribution-free guarantees.
minor comments (3)
- Clarify in the main text (near the envelope construction) whether the modulation of envelope shape is performed in a data-independent manner or if any data-dependent tuning is used; the coverage statement must remain unaffected.
- In the experimental section, report the exact number of Monte Carlo samples used for envelope construction and include a sensitivity check showing that the reported bounds stabilize with increasing sample size.
- Add a brief remark on computational cost of the sampling procedure relative to standard conformal p-value computation, especially for large calibration sets.
Simulated Author's Rebuttal
We thank the referee for their positive review and accurate summary of our manuscript. We appreciate the recognition of the significance of our finite-sample, distribution-free simultaneous bounds on the FDP and the recommendation for minor revision. Since the report does not list any specific major comments, we have no points to address point-by-point at this stage. We will incorporate any minor suggestions from the editor or further review in the revised manuscript.
Circularity Check
No significant circularity detected
full rationale
The derivation constructs simultaneous finite-sample distribution-free upper bounds on the FDP by building a high-probability envelope around the ECDF of null conformal p-values. This envelope is obtained by direct Monte Carlo sampling from the exact joint distribution of those p-values, which is known to be uniform and distribution-free under the null due to the rank-based definition of conformal p-values. The sampling step uses only the known null properties and does not depend on fitted parameters from the observed data, post-hoc threshold selection, or any self-referential quantities. No load-bearing self-citations, ansatzes smuggled via prior work, or reductions of predictions to fitted inputs appear in the central argument; the construction remains self-contained against the external benchmark of conformal p-value uniformity.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Null conformal p-values admit sampling from their joint distribution under the null.
Reference graph
Works this paper leans on
-
[1]
International Conference on Artificial Intelligence and Statistics , pages=
Transductive conformal inference with adaptive scores , author=. International Conference on Artificial Intelligence and Statistics , pages=. 2024 , organization=
work page 2024
-
[2]
Proceedings of the 23rd international conference on Machine learning , pages=
The relationship between Precision-Recall and ROC curves , author=. Proceedings of the 23rd international conference on Machine learning , pages=
-
[3]
Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=
Simultaneous false discovery proportion bounds via knockoffs and closed testing , author=. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=. 2024 , publisher=
work page 2024
-
[4]
Continuous Multivariate Distributions, Volume 1: Models and Applications , author=. 2000 , publisher=
work page 2000
-
[5]
The annals of Probability , pages=
The tight constant in the Dvoretzky-Kiefer-Wolfowitz inequality , author=. The annals of Probability , pages=. 1990 , publisher=
work page 1990
-
[6]
Large-scale inference: empirical Bayes methods for estimation, testing, and prediction , author=. 2012 , publisher=
work page 2012
-
[7]
Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation,
Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , author=. arXiv preprint arXiv:2010.16061 , year=
-
[8]
Advances in neural information processing systems , volume=
Selective classification for deep neural networks , author=. Advances in neural information processing systems , volume=
-
[9]
arXiv preprint arXiv:2602.10018 , year=
Online Selective Conformal Prediction with Asymmetric Rules: A Permutation Test Approach , author=. arXiv preprint arXiv:2602.10018 , year=
-
[10]
Higher criticism: p-values and criticism , author=
-
[11]
Advances in neural information processing systems , volume=
Support vector method for novelty detection , author=. Advances in neural information processing systems , volume=
-
[12]
A central limit theorem for the Benjamini-Hochberg false discovery proportion under a factor model , author=. Bernoulli , volume=. 2024 , publisher=
work page 2024
-
[13]
The Annals of Statistics , volume=
Adaptive novelty detection with false discovery rate guarantee , author=. The Annals of Statistics , volume=. 2024 , publisher=
work page 2024
-
[14]
Nature biotechnology , volume=
Comprehensive analysis of kinase inhibitor selectivity , author=. Nature biotechnology , volume=. 2011 , publisher=
work page 2011
-
[15]
A review of novelty detection , author=. Signal processing , volume=. 2014 , publisher=
work page 2014
-
[16]
Algorithmic learning in a random world , author=. 2005 , publisher=
work page 2005
-
[17]
Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=
Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach , author=. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=. 2004 , publisher=
work page 2004
-
[18]
Higher criticism for detecting sparse heterogeneous mixtures , author=
-
[19]
Journal of Probability and Statistics , volume=
Control of the false discovery proportion for independently tested null hypotheses , author=. Journal of Probability and Statistics , volume=. 2012 , publisher=
work page 2012
-
[20]
The Annals of Mathematical Statistics , pages=
Asymptotic minimax character of the sample distribution function and of the classical multinomial estimator , author=. The Annals of Mathematical Statistics , pages=. 1956 , publisher=
work page 1956
-
[21]
Advances in neural information processing systems , volume=
Conformal prediction under covariate shift , author=. Advances in neural information processing systems , volume=
-
[22]
ACM Transactions on Mathematical Software (TOMS) , volume=
Parallel weighted random sampling , author=. ACM Transactions on Mathematical Software (TOMS) , volume=. 2022 , publisher=
work page 2022
-
[23]
Brazilian Journal of Probability and Statistics , volume=
Weighted sampling without replacement , author=. Brazilian Journal of Probability and Statistics , volume=. 2018 , publisher=
work page 2018
-
[24]
Rate of convergence of empirical measures for exchangeable sequences , author=. Mathematica Slovaca , volume=. 2017 , publisher=
work page 2017
-
[25]
arXiv preprint arXiv:2208.06685 , year=
Machine learning meets false discovery rate , author=. arXiv preprint arXiv:2208.06685 , year=
-
[26]
arXiv preprint arXiv:2010.09686 , year=
Estimating means of bounded random variables by betting , author=. arXiv preprint arXiv:2010.09686 , year=
-
[27]
The Annals of Statistics , volume=
Testing for outliers with conformal p-values , author=. The Annals of Statistics , volume=. 2023 , publisher=
work page 2023
-
[28]
Journal of Machine Learning Research , volume=
Selection by prediction with conformal p-values , author=. Journal of Machine Learning Research , volume=
-
[29]
arXiv preprint arXiv:2307.09291 , year=
Model-free selective inference under covariate shift via weighted conformal p-values , author=. arXiv preprint arXiv:2307.09291 , year=
-
[30]
Goeman and Aldo Solari , title =
Jelle J. Goeman and Aldo Solari , title =. Statistical Science , number =. 2011 , doi =
work page 2011
-
[31]
Journal of the American Statistical Association , volume=
Exceedance control of the false discovery proportion , author=. Journal of the American Statistical Association , volume=. 2006 , publisher=
work page 2006
-
[32]
Simultaneous control of all false discovery proportions in large-scale multiple hypothesis testing , author=. Biometrika , volume=. 2019 , publisher=
work page 2019
-
[33]
Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=
Permutation-based true discovery guarantee by sum tests , author=. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=. 2023 , publisher=
work page 2023
-
[34]
arXiv preprint arXiv:2212.12822 , year=
Simultaneous false discovery proportion bounds via knockoffs and closed testing , author=. arXiv preprint arXiv:2212.12822 , year=
-
[35]
The Annals of Statistics , number =
Christopher Genovese and Larry Wasserman , title =. The Annals of Statistics , number =. 2004 , doi =
work page 2004
-
[36]
Scandinavian Journal of Statistics , volume=
False discovery control for multiple tests of association under general dependence , author=. Scandinavian Journal of Statistics , volume=. 2006 , publisher=
work page 2006
-
[37]
Permutation-based simultaneous confidence bounds for the false discovery proportion , author=. Biometrika , volume=. 2019 , publisher=
work page 2019
-
[38]
Goeman and Jesse Hemerik and Aldo Solari , title =
Jelle J. Goeman and Jesse Hemerik and Aldo Solari , title =. The Annals of Statistics , number =. 2021 , doi =
work page 2021
-
[39]
The Annals of Statistics , number =
Eugene Katsevich and Aaditya Ramdas , title =. The Annals of Statistics , number =. 2020 , doi =
work page 2020
-
[40]
The Annals of Statistics , number =
Gilles Blanchard and Pierre Neuvial and Etienne Roquain , title =. The Annals of Statistics , number =. 2020 , doi =
work page 2020
-
[41]
Information and Inference: A Journal of the IMA , volume=
The limits of distribution-free conditional predictive inference , author=. Information and Inference: A Journal of the IMA , volume=. 2021 , publisher=
work page 2021
-
[42]
Journal of the Royal statistical society: series B (Methodological) , volume=
Controlling the false discovery rate: a practical and powerful approach to multiple testing , author=. Journal of the Royal statistical society: series B (Methodological) , volume=. 1995 , publisher=
work page 1995
-
[43]
arXiv preprint arXiv:2411.17983 , year=
Optimized conformal selection: Powerful selective inference after conformity score optimization , author=. arXiv preprint arXiv:2411.17983 , year=
-
[44]
arXiv preprint arXiv:2506.16229 , year=
Diversifying Conformal Selections , author=. arXiv preprint arXiv:2506.16229 , year=
-
[45]
arXiv preprint arXiv:2507.15825 , year=
ACS: An interactive framework for conformal selection , author=. arXiv preprint arXiv:2507.15825 , year=
-
[46]
TxConformal: Controlling False Discoveries in AI-Driven Therapeutic Discovery , author=. bioRxiv , pages=. 2026 , publisher=
work page 2026
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.