RRISE: Robust Radius Inference via a Surrogate Estimator
Pith reviewed 2026-06-28 15:22 UTC · model grok-4.3
The pith
RRISE compresses randomized smoothing certification into one surrogate forward pass while preserving provably conservative radii.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
RRISE trains a surrogate against precomputed Monte Carlo class-count targets via soft-label cross-entropy loss, then applies one-time conformal calibration to turn the surrogate outputs into provably conservative certified radii. Whenever the calibrated radius is positive, the surrogate prediction matches the smoothed classifier and the smoothed classifier is constant on a ball of that radius around the input.
What carries the argument
Surrogate estimator trained on Monte Carlo class-count targets, followed by conformal calibration to produce deployment-verifiable conservative radii.
If this is right
- Matches fixed-budget Monte Carlo certified accuracy within 0.84 percentage points across image classification benchmarks.
- Replaces up to 10,000 noisy base-model evaluations per query with a single surrogate forward pass.
- Recoups the Monte Carlo training cost after approximately 100,000 deployment queries.
- Achieves 1.23 to 1.91 times higher certified accuracy than the prior offline-surrogate method on CIFAR-100 and Tiny ImageNet.
- Produces certificates that are deployment-verifiable whenever the radius is positive.
Where Pith is reading between the lines
- The amortized cost reduction could allow certified robustness to be applied in high-query-volume settings such as online services without dominating latency.
- Similar surrogate training on sampling-based certificates might be tested in non-vision domains where Monte Carlo costs are also high.
- The one-time calibration step could be revisited periodically if the base model is fine-tuned after deployment.
- Combining the surrogate with adaptive sampling budgets might further reduce the initial training data collection cost.
Load-bearing premise
The surrogate trained on precomputed class-count targets and then conformally calibrated produces radii that remain conservative and match the smoothed classifier's behavior on the ball.
What would settle it
Finding an input where the surrogate outputs a positive calibrated radius but the actual smoothed classifier changes its label inside that radius.
Figures
read the original abstract
Randomized smoothing (RS) uses a smoothed classifier to provide architecture-agnostic certificates of $\ell_2$ classification robustness, but its dependence on per-input Monte Carlo (MC) sampling undermines its use in real-time systems. We argue that this cost is structural rather than fundamental, such that it can be significantly reduced by sharing information across the deployment stream. We introduce RRISE, an RS framework that compresses certification into a single forward pass through a learned surrogate. RRISE trains the surrogate against precomputed MC class-count targets via a soft-label cross-entropy loss and converts surrogate predictions into provably conservative certified radii through a one-time conformal calibration step. The resulting certificate is deployment-verifiable: whenever the calibrated radius is positive, the surrogate's prediction provably matches the smoothed classifier's and the smoothed classifier is constant on a ball of that radius around the input. Across image classification benchmarks, RRISE matches fixed-budget MC certified accuracy within $0.84$ percentage points while replacing up to $10^4$ noisy base-model evaluations per query with a single surrogate forward pass, recouping MC training cost after $\approx 10^5$ deployment queries. On CIFAR-100 and Tiny ImageNet, where the only prior offline-surrogate method collapses, RRISE achieves $1.23$ to $1.91\times$ higher certified accuracy, establishing efficient randomized smoothing as a practical path to certified robustness in repeated-deployment settings.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces RRISE, a randomized smoothing framework that trains a surrogate model on precomputed Monte Carlo class-count targets using soft-label cross-entropy loss. A one-time conformal calibration step then converts surrogate outputs into provably conservative certified ℓ₂ radii. The resulting certificates are deployment-verifiable: a positive radius guarantees that the surrogate prediction matches the smoothed classifier and that the smoothed classifier is constant on the corresponding ball. Empirically, RRISE matches fixed-budget MC certified accuracy within 0.84 percentage points while replacing up to 10⁴ base-model evaluations per query with one surrogate forward pass, with 1.23–1.91× higher certified accuracy than the prior offline-surrogate method on CIFAR-100 and Tiny ImageNet.
Significance. If the central claims hold, RRISE demonstrates that the per-query Monte Carlo cost of randomized smoothing is amortizable rather than fundamental, making certified robustness practical for repeated-deployment settings after roughly 10⁵ queries. The use of precomputed MC targets plus standard conformal calibration, together with the explicit deployment-verifiable guarantee, provides a clean separation between training and certification that prior surrogate approaches lacked. The reported gains on datasets where the only competing offline method collapses are a concrete strength.
minor comments (3)
- [§3.2] §3.2: the precise form of the soft-label cross-entropy loss (including temperature or label-smoothing parameters) is not stated explicitly; adding the equation would remove ambiguity about how the surrogate is optimized against the MC targets.
- [§4.3] §4.3, Table 2: the conformal calibration quantile is reported only as a single value per dataset; stating the exact α used and confirming it was chosen on a held-out calibration set (rather than tuned on test data) would strengthen the reproducibility claim.
- [Figure 3] Figure 3: the y-axis label 'Certified Accuracy' should specify whether it is measured against the base classifier or the smoothed classifier, and the error bars should be described in the caption.
Simulated Author's Rebuttal
We thank the referee for the positive summary, significance assessment, and recommendation of minor revision. The report contains no major comments requiring point-by-point rebuttal.
Circularity Check
No significant circularity
full rationale
The derivation trains a surrogate on precomputed Monte Carlo class-count targets using standard soft-label cross-entropy, then applies one-time conformal calibration to obtain conservative radii. Both the training targets and the calibration step are external to the final radius output and do not reduce the certified-radius claim to a fitted quantity by construction. No self-definitional equations, fitted-input-as-prediction, or load-bearing self-citations appear in the provided abstract or high-level argument. The deployment-verifiable guarantee follows from standard conformal properties rather than an internal redefinition.
Axiom & Free-Parameter Ledger
free parameters (2)
- surrogate model parameters
- conformal calibration quantile
axioms (1)
- domain assumption The smoothed classifier is constant on a ball of the certified radius around the input when the surrogate prediction matches the MC estimate after calibration.
Reference graph
Works this paper leans on
-
[1]
International Conference on Machine Learning , pages=
On calibration of modern neural networks , author=. International Conference on Machine Learning , pages=. 2017 , organization=
2017
-
[2]
International Conference on Machine Learning , pages=
Dropout as a Bayesian approximation: Representing model uncertainty in deep learning , author=. International Conference on Machine Learning , pages=. 2016 , organization=
2016
-
[3]
Advances in Neural Information Processing Systems , volume=
Simple and scalable predictive uncertainty estimation using deep ensembles , author=. Advances in Neural Information Processing Systems , volume=
-
[4]
Advances in Neural Information Processing Systems , volume=
A simple baseline for Bayesian uncertainty in deep learning , author=. Advances in Neural Information Processing Systems , volume=
-
[5]
Journal of Machine Learning Research , volume=
A simple approach to improve single-model deep uncertainty via distance-awareness , author=. Journal of Machine Learning Research , volume=
-
[6]
Advances in Neural Information Processing Systems , volume=
Energy-based out-of-distribution detection , author=. Advances in Neural Information Processing Systems , volume=
-
[7]
Advances in Neural Information Processing Systems , volume=
Classification with valid and adaptive coverage , author=. Advances in Neural Information Processing Systems , volume=
-
[8]
arXiv preprint arXiv:2009.14193 , year=
Uncertainty sets for image classifiers using conformal prediction , author=. arXiv preprint arXiv:2009.14193 , year=
arXiv 2009
-
[9]
Advances in Neural Information Processing Systems , volume=
Training uncertainty-aware classifiers with conformalized deep learning , author=. Advances in Neural Information Processing Systems , volume=
-
[10]
ACM Computing Surveys , year=
Conformal prediction: A data perspective , author=. ACM Computing Surveys , year=
-
[11]
arXiv preprint arXiv:2305.15508 , year=
How to fix a broken confidence estimator: Evaluating post-hoc methods for selective classification with deep neural networks , author=. arXiv preprint arXiv:2305.15508 , year=
-
[12]
Advances in Neural Information Processing Systems , volume=
Evidential deep learning to quantify classification uncertainty , author=. Advances in Neural Information Processing Systems , volume=
-
[13]
International Conference on Learning Representations , year=
Enhancing The Reliability of Out-of-distribution Image Detection in Neural Networks , author=. International Conference on Learning Representations , year=
-
[14]
Advances in Neural Information Processing Systems , volume=
Are uncertainty quantification capabilities of evidential deep learning a mirage? , author=. Advances in Neural Information Processing Systems , volume=
-
[15]
Advances in Neural Information Processing Systems , volume=
Formal guarantees on the robustness of a classifier against adversarial manipulation , author=. Advances in Neural Information Processing Systems , volume=
-
[16]
International Conference on Machine Learning , pages=
Certified adversarial robustness via randomized smoothing , author=. International Conference on Machine Learning , pages=. 2019 , organization=
2019
-
[17]
2019 IEEE Symposium on Security and Privacy (SP) , pages=
Certified robustness to adversarial examples with differential privacy , author=. 2019 IEEE Symposium on Security and Privacy (SP) , pages=. 2019 , organization=
2019
-
[18]
arXiv preprint arXiv:2404.17371 , year=
Estimating the Robustness Radius for Randomized Smoothing with 100x Sample Efficiency , author=. arXiv preprint arXiv:2404.17371 , year=
-
[19]
Proceedings of the AAAI Conference on Artificial Intelligence , volume=
Input-specific robustness certification for randomized smoothing , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=
-
[20]
Advances in Neural Information Processing Systems , volume=
Treatment of statistical estimation problems in randomized smoothing for adversarial robustness , author=. Advances in Neural Information Processing Systems , volume=
-
[21]
Journal of the American Statistical Association , volume=
Strictly proper scoring rules, prediction, and estimation , author=. Journal of the American Statistical Association , volume=. 2007 , publisher=
2007
-
[22]
Advances in Neural Information Processing Systems , volume=
Conformalized quantile regression , author=. Advances in Neural Information Processing Systems , volume=
-
[23]
The Twelfth International Conference on Learning Representations , year=
Incremental Randomized Smoothing Certification , author=. The Twelfth International Conference on Learning Representations , year=
-
[24]
Proceedings of the AAAI Conference on Artificial Intelligence , volume=
Towards large certified radius in randomized smoothing using quasiconcave optimization , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=
-
[25]
Advances in Neural Information Processing Systems , volume=
Provably robust deep learning via adversarially trained smoothed classifiers , author=. Advances in Neural Information Processing Systems , volume=
-
[26]
Advances in Neural Information Processing Systems , volume=
Certified defense to image transformations via randomized smoothing , author=. Advances in Neural Information Processing Systems , volume=
-
[27]
International Conference on Machine Learning , pages=
Randomized smoothing of all shapes and sizes , author=. International Conference on Machine Learning , pages=. 2020 , organization=
2020
-
[28]
arXiv preprint arXiv:2402.07498 , year=
Accelerated Smoothing: A Scalable Approach to Randomized Smoothing , author=. arXiv preprint arXiv:2402.07498 , year=
-
[29]
Proceedings of the IEEE Conference on Computer Cision and Pattern Recognition , pages=
Robust physical-world attacks on deep learning visual classification , author=. Proceedings of the IEEE Conference on Computer Cision and Pattern Recognition , pages=
-
[30]
Pattern Recognition , volume=
Understanding adversarial attacks on deep learning based medical image analysis systems , author=. Pattern Recognition , volume=. 2021 , publisher=
2021
-
[31]
Xu, Xuwei and Wang, Sen and Chen, Yudong and Zheng, Yanping and Wei, Zhewei and Liu, Jiajun , booktitle=
-
[32]
Advances in Neural Information Processing Systems , volume=
Certified adversarial robustness with additive noise , author=. Advances in Neural Information Processing Systems , volume=
-
[33]
Transactions on Machine Learning Research , year=
Projected randomized smoothing for certified adversarial robustness , author=. Transactions on Machine Learning Research , year=
-
[34]
Uncertainty in Artificial Intelligence , pages=
Data dependent randomized smoothing , author=. Uncertainty in Artificial Intelligence , pages=. 2022 , organization=
2022
-
[35]
Proceedings of the ACM on Programming Languages , volume=
An abstract domain for certifying neural networks , author=. Proceedings of the ACM on Programming Languages , volume=. 2019 , publisher=
2019
-
[36]
International Conference on Machine Learning , pages=
Towards fast computation of certified robustness for relu networks , author=. International Conference on Machine Learning , pages=. 2018 , organization=
2018
-
[37]
2022 , organization=
Kong, Zhenglun and Dong, Peiyan and Ma, Xiaolong and Meng, Xin and Niu, Wei and Sun, Mengshu and Shen, Xuan and Yuan, Geng and Ren, Bin and Tang, Hao and others , booktitle=. 2022 , organization=
2022
-
[38]
Advances in Neural Information Processing Systems , volume=
Efficient formal safety analysis of neural networks , author=. Advances in Neural Information Processing Systems , volume=
-
[39]
Conference on Robot Learning , pages=
Robust trajectory prediction against adversarial attacks , author=. Conference on Robot Learning , pages=. 2023 , organization=
2023
-
[40]
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , pages=
Real-time, universal, and robust adversarial attacks against speaker recognition systems , author=. ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , pages=. 2020 , organization=
2020
-
[41]
IEEE Transactions on Intelligent Vehicles , year=
Adversarial attacks on autonomous driving systems in the physical world: a survey , author=. IEEE Transactions on Intelligent Vehicles , year=
-
[42]
Machine learning , volume=
Analysis of classifiers’ robustness to adversarial perturbations , author=. Machine learning , volume=. 2018 , publisher=
2018
-
[43]
, author=
A tutorial on conformal prediction. , author=. Journal of Machine Learning Research , volume=
-
[44]
Clustering with
Banerjee, Arindam and Merugu, Srujana and Dhillon, Inderjit S and Ghosh, Joydeep , journal=. Clustering with
-
[45]
Inequalities for the
Weissman, Tsachy and Ordentlich, Erik and Seroussi, Gadiel and Verdu, Sergio and Weinberger, Marcelo J , journal=. Inequalities for the
-
[46]
Science , volume=
Adversarial attacks on medical machine learning , author=. Science , volume=. 2019 , publisher=
2019
-
[47]
Advances in Neural Information Processing Systems , volume=
Selective classification for deep neural networks , author=. Advances in Neural Information Processing Systems , volume=
-
[48]
Proceedings of the European Conference on Computer Vision (ECCV) , pages=
Is robustness the cost of accuracy?--a comprehensive study on the robustness of 18 deep image classification models , author=. Proceedings of the European Conference on Computer Vision (ECCV) , pages=
-
[49]
International Journal of Computer Vision , volume=
A comprehensive study on robustness of image classification models: Benchmarking and rethinking , author=. International Journal of Computer Vision , volume=. 2025 , publisher=
2025
-
[50]
Nature Reviews Physics , volume=
Physics-informed machine learning , author=. Nature Reviews Physics , volume=. 2021 , publisher=
2021
-
[51]
arXiv preprint arXiv:2312.12608 , year=
Rethinking Randomized Smoothing from the Perspective of Scalability , author=. arXiv preprint arXiv:2312.12608 , year=
-
[52]
Proceedings of the 10th ACM workshop on artificial intelligence and security , pages=
Adversarial examples are not easily detected: Bypassing ten detection methods , author=. Proceedings of the 10th ACM workshop on artificial intelligence and security , pages=
-
[53]
Foundations and Trends in Machine Learning , volume=
Conformal prediction: A gentle introduction , author=. Foundations and Trends in Machine Learning , volume=. 2023 , publisher=
2023
-
[54]
IEEE Access , volume=
Threat of adversarial attacks on deep learning in computer vision: A survey , author=. IEEE Access , volume=. 2018 , publisher=
2018
-
[55]
Journal of the American Statistical Association , volume=
Distribution-free predictive inference for regression , author=. Journal of the American Statistical Association , volume=. 2018 , publisher=
2018
-
[56]
Biometrika , volume=
The use of confidence or fiducial limits illustrated in the case of the binomial , author=. Biometrika , volume=. 1934 , publisher=
1934
-
[57]
Advances in Neural Information Processing Systems , volume=
On adaptive attacks to adversarial example defenses , author=. Advances in Neural Information Processing Systems , volume=
-
[58]
M. Raissi and P. Perdikaris and G.E. Karniadakis , keywords =. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations , journal =. 2019 , issn =. doi:https://doi.org/10.1016/j.jcp.2018.10.045 , url =
-
[59]
Advances in Neural Information Processing Systems , volume=
Sparse weight activation training , author=. Advances in Neural Information Processing Systems , volume=
-
[60]
Image Sensor for Automotive Use [Product] , year =
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.