Adversarial Contamination Meets Hard Thresholding: An Iterative Algorithm with Signal Adaptivity and Minimax Optimality

Hanming Yang; Shixiang Liu

arxiv: 2606.27685 · v1 · pith:5QRKEETWnew · submitted 2026-06-26 · 📊 stat.ML · cs.LG

Adversarial Contamination Meets Hard Thresholding: An Iterative Algorithm with Signal Adaptivity and Minimax Optimality

Shixiang Liu , Hanming Yang This is my paper

Pith reviewed 2026-06-29 03:37 UTC · model grok-4.3

classification 📊 stat.ML cs.LG

keywords adversarial contaminationiterative hard thresholdinghigh-dimensional regressionrobust statisticssignal adaptivityoracle propertyminimax optimalitysupport recovery

0 comments

The pith

A new iterative hard thresholding algorithm achieves minimax near-optimal estimation in high-dimensional regression under adversarial contamination.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes the AC-IHT algorithm for high-dimensional regression when data may be corrupted by adversarial contamination. The method iteratively refines estimates of both the regression coefficients and the contamination vector, applying different hard thresholding scales to each. This yields estimation accuracy that is near the minimax optimal rate, up to logarithmic factors. When signal conditions are met, the procedure automatically adapts to deliver sharper rates and better support recovery while satisfying the strong oracle property that enables asymptotic inference.

Core claim

The AC-IHT algorithm achieves minimax near-optimal estimation by iteratively updating the coefficient vector and the contamination vector with different thresholding scales. Under proper signal conditions, it adaptively attains a sharper estimation rate and more accurate support recovery. Moreover, it enjoys the strong oracle property.

What carries the argument

The AC-IHT algorithm, which performs iterative updates to the coefficient and contamination vectors using separate hard thresholding operations at different scales.

If this is right

It achieves minimax near-optimal estimation rates up to logarithmic terms.
Under proper signal conditions it attains sharper estimation rates and more accurate support recovery.
It satisfies the strong oracle property for asymptotic inference.
Numerical experiments show superior finite-sample performance.
The procedure extends theoretically to generalized linear models and heavy-tailed noise.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The adaptive behavior may allow the method to be applied in settings where signal strength varies across coordinates without prior knowledge.
Similar separate-thresholding ideas could be tested in other robust estimation problems such as covariance estimation under contamination.
The oracle property opens the door to constructing valid confidence intervals after variable selection in contaminated data.

Load-bearing premise

Proper signal conditions hold that enable the adaptive sharper rates and the strong oracle property.

What would settle it

A high-dimensional regression experiment with adversarial contamination in which the AC-IHT estimator's error rate fails to match the claimed minimax near-optimal bound up to log factors.

Figures

Figures reproduced from arXiv: 2606.27685 by Hanming Yang, Shixiang Liu.

**Figure 2.** Figure 2: The ℓ2 estimation error of the two-stage AC-IHT algorithm with varying s and o under 300 replications. As illustrated in [PITH_FULL_IMAGE:figures/full_fig_p021_2.png] view at source ↗

**Figure 3.** Figure 3: Comparison of asymptotic normality across AC-IHT, IHT- [PITH_FULL_IMAGE:figures/full_fig_p022_3.png] view at source ↗

**Figure 4.** Figure 4: Convergence dynamics of AC-IHT over iterations for different values of [PITH_FULL_IMAGE:figures/full_fig_p023_4.png] view at source ↗

**Figure 5.** Figure 5: Support recovery performance with increasing [PITH_FULL_IMAGE:figures/full_fig_p024_5.png] view at source ↗

**Figure 6.** Figure 6: Feasible region (shaded) and two exemplary threshold update trajectories (solid purple and dashed [PITH_FULL_IMAGE:figures/full_fig_p031_6.png] view at source ↗

read the original abstract

Pervasive data contamination -- stemming from measurement errors, outliers, or adversarial corruption -- has motivated the development of robust statistical methods. In this context, we propose a two-stage Adversarial Contamination-resistant Iterative Hard Thresholding (AC-IHT) algorithm for high-dimensional regression with contamination. Our nonconvex algorithm achieves minimax near-optimal (up to logarithmic terms) estimation by iteratively updating the coefficient vector and the contamination vector with different thresholding scales. We further demonstrate that our AC-IHT estimator is signal-adaptive: under proper signal conditions, it adaptively attains a sharper estimation rate and more accurate support recovery. Moreover, it enjoys the strong oracle property, laying a theoretical foundation for asymptotic inference. Numerical experiments confirm its superior finite-sample performance. Finally, we discuss theoretical extensions of the proposed procedure to generalized linear models and to heavy-tailed noise settings.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

AC-IHT extends IHT to handle adversarial contamination via separate thresholding on coefficients and contamination, with claims of near-minimax rates plus signal adaptivity under specific conditions.

read the letter

This paper introduces AC-IHT, a nonconvex iterative algorithm that updates both the coefficient vector and a contamination vector at different thresholding scales for high-dimensional regression under adversarial corruption. The main new element is this explicit handling of contamination inside the IHT loop, which the authors say delivers minimax near-optimal rates up to logs, plus signal-adaptivity and the strong oracle property when signal conditions hold.

The work does a reasonable job framing a practical extension of existing IHT methods to the robust setting and backing the claims with theory plus some numerical checks. The idea of using different scales for the two vectors is direct and could be a useful tweak for separating signal from corruption.

The soft spots center on the load-bearing 'proper signal conditions' required for adaptivity and the oracle property. If those conditions end up restrictive, say demanding signal strength well above the contamination level, then the sharper rates may not apply in many realistic cases and the method loses ground to simpler robust alternatives. The abstract also leaves open how the nonconvex iterations are guaranteed to converge and whether the rates hold without additional tuning parameters. The mentioned extensions to GLMs and heavy-tailed noise are only sketched.

This is aimed at people working on robust high-dimensional estimation and sparse inference. A reader who cares about theoretical rates and oracle properties in contaminated data would get something from it. It has enough concrete claims to deserve a serious referee rather than a desk reject, so the proofs and condition strength can be checked directly.

Referee Report

1 major / 0 minor

Summary. The paper proposes the Adversarial Contamination-resistant Iterative Hard Thresholding (AC-IHT) algorithm for high-dimensional linear regression under adversarial contamination. It claims that this nonconvex iterative procedure achieves minimax near-optimal estimation rates (up to logarithmic factors) by alternately updating the coefficient vector and contamination vector using distinct hard-thresholding scales. The paper further claims that the estimator is signal-adaptive, attaining sharper rates and more accurate support recovery under suitable signal conditions, and possesses the strong oracle property that supports asymptotic inference. Numerical experiments are presented to illustrate finite-sample performance, with theoretical extensions discussed for generalized linear models and heavy-tailed noise.

Significance. If the central claims on minimax near-optimality, signal-adaptivity, and the strong oracle property are rigorously established with explicit conditions and derivations, the work would offer a computationally attractive robust estimator for contaminated high-dimensional data that combines adaptivity with inferential guarantees, potentially advancing robust statistics in sparse regression settings.

major comments (1)

[Abstract] Abstract: the claims of minimax near-optimality, signal-adaptivity under 'proper signal conditions,' and the strong oracle property are presented without any derivation, assumption statement, or rate expression visible in the provided text. Because these are the load-bearing assertions, the absence of the supporting technical development (e.g., the precise minimum-signal-strength condition relative to contamination level and sparsity, or the iteration analysis) prevents verification that the rates are independently derived rather than tautological.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their review. We address the single major comment below.

read point-by-point responses

Referee: [Abstract] Abstract: the claims of minimax near-optimality, signal-adaptivity under 'proper signal conditions,' and the strong oracle property are presented without any derivation, assumption statement, or rate expression visible in the provided text. Because these are the load-bearing assertions, the absence of the supporting technical development (e.g., the precise minimum-signal-strength condition relative to contamination level and sparsity, or the iteration analysis) prevents verification that the rates are independently derived rather than tautological.

Authors: The abstract is a concise summary of contributions and does not contain technical details by design; all supporting assumptions, derivations, and explicit rates appear in the main text. Minimax near-optimality (up to log factors) is proved in Theorem 1 under Assumptions 1--3 with the rate O((s log(p)/n) + au), where au denotes the adversarial contamination fraction. Signal-adaptivity and the associated minimum-signal-strength condition eta_min \gtrsim au + heta sqrt((log p)/n) are stated and derived in Theorem 2. The strong oracle property is established in Theorem 3. The two-stage iteration analysis, including contraction mapping arguments that guarantee linear convergence, is given in Section 3.2. These results are obtained from first-principles analysis of the non-convex updates and are not tautological. If the referee received only the abstract, the full manuscript supplies the requested technical development. revision: no

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The abstract describes a nonconvex iterative algorithm (AC-IHT) that updates coefficient and contamination vectors with different thresholding scales to achieve minimax near-optimal rates (up to logs), signal-adaptivity under proper signal conditions, and the strong oracle property. No equations, derivations, or parameter-fitting steps are supplied in the visible text. No self-definitional reductions, fitted inputs renamed as predictions, or load-bearing self-citations appear. The derivation chain is therefore treated as self-contained against external benchmarks; the signal conditions are stated as assumptions rather than derived outputs. This is the expected honest non-finding for an abstract-only view.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no explicit details on free parameters, axioms, or invented entities; the contamination vector and thresholding scales are mentioned but their status as fitted or assumed cannot be determined.

pith-pipeline@v0.9.1-grok · 5677 in / 1118 out tokens · 44552 ms · 2026-06-29T03:37:35.885920+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

85 extracted references · 28 canonical work pages · 1 internal anchor

[1]

Optimal Estimation of the Null Distribution in Large-Scale Inference , year=

Kotekal, Subhodh and Gao, Chao , journal=. Optimal Estimation of the Null Distribution in Large-Scale Inference , year=
[2]

Outlier-robust estimation of a sparse linear model using _1 -penalized Huber s M-estimator , url =

Dalalyan, Arnak and Thompson, Philip , booktitle =. Outlier-robust estimation of a sparse linear model using _1 -penalized Huber s M-estimator , url =
[3]

Electronic Journal of Statistics , number =

Geoffrey Chinot , title =. Electronic Journal of Statistics , number =. 2020 , doi =

2020
[4]

SIAM Journal on Mathematics of Data Science , volume =

Minsker, Stanislav and Ndaoud, Mohamed and Wang, Lang , title =. SIAM Journal on Mathematics of Data Science , volume =. 2024 , doi =

2024
[5]

The nonparametric Box–Cox model for high-dimensional regression analysis , journal =

He Zhou and Hui Zou , keywords =. The nonparametric Box–Cox model for high-dimensional regression analysis , journal =. 2024 , issn =. doi:https://doi.org/10.1016/j.jeconom.2023.01.025 , url =

work page doi:10.1016/j.jeconom.2023.01.025 2024
[6]

2024 , eprint=

A Non-Parametric Box-Cox Approach to Robustifying High-Dimensional Linear Hypothesis Testing , author=. 2024 , eprint=

2024
[7]

The Annals of Statistics , number =

Yinan Shen and Jingyang Li and Jian-Feng Cai and Dong Xia , title =. The Annals of Statistics , number =. 2025 , doi =

2025
[8]

Bernoulli , number =

Chao Gao , title =. Bernoulli , number =. 2020 , doi =

2020
[9]

Electronic Journal of Statistics , number =

Mengjie Chen and Chao Gao and Zhao Ren , title =. Electronic Journal of Statistics , number =. 2016 , doi =

2016
[10]

arXiv preprint arXiv:2412.19183 , year=

Outlier-Bias Removal with Alpha Divergence: A Robust Non-Convex Estimator for Linear Regression , author=. arXiv preprint arXiv:2412.19183 , year=

work page arXiv
[11]

Electronic Communications in Probability , number =

Daniel Hsu and Sham Kakade and Tong Zhang , title =. Electronic Communications in Probability , number =. 2012 , doi =

2012
[12]

2019 , publisher=

High-dimensional statistics: A non-asymptotic viewpoint , author=. 2019 , publisher=

2019
[13]

The Annals of Statistics , number =

Mengjie Chen and Chao Gao and Zhao Ren , title =. The Annals of Statistics , number =. 2018 , doi =

2018
[14]

Huber , title =

Peter J. Huber , title =. The Annals of Mathematical Statistics , number =. 1964 , doi =

1964
[15]

and Tseng, P

Sardy, S. and Tseng, P. and Bruce, A. , journal=. Robust wavelet denoising , year=. doi:10.1109/78.923297 , URL=

work page doi:10.1109/78.923297
[16]

Tyrrell Rockafellar , title =

R. Tyrrell Rockafellar , title =. 1997 , pages =

1997
[17]

Wiley Series in Probability and Mathematical Statistics , year=

Robust statistics , author=. Wiley Series in Probability and Mathematical Statistics , year=
[18]

Statistics and Computing , volume=

Robust estimation and wavelet thresholding in partially linear models , author=. Statistics and Computing , volume=. 2007 , publisher=. doi:10.1007/s11222-007-9019-x , URL =

work page doi:10.1007/s11222-007-9019-x 2007
[19]

2007 , doi =

Robust variable selection using least angle regression and elemental set sampling , journal =. 2007 , doi =

2007
[20]

Owen , title =

Yiyuan She and Art B. Owen , title =. Journal of the American Statistical Association , volume =. 2011 , publisher =. doi:10.1198/jasa.2011.tm10390 , URL =

work page doi:10.1198/jasa.2011.tm10390 2011
[21]

MacEachern and Yoonsuh Jung , title =

Yoonkyung Lee and Steven N. MacEachern and Yoonsuh Jung , title =. Statistical Science , number =. 2012 , doi =

2012
[22]

arXiv preprint arXiv:2012.06750 , year=

Outlier-robust sparse/low-rank least-squares regression and robust matrix completion , author=. arXiv preprint arXiv:2012.06750 , year=

work page arXiv 2012
[23]

and Tran, Trac D

Nguyen, Nam H. and Tran, Trac D. , journal=. Robust Lasso With Missing and Grossly Corrupted Observations , year=. doi:10.1109/TIT.2012.2232347 , URL=

work page doi:10.1109/tit.2012.2232347 2012
[24]

Constructive Approximation , volume=

Compressed sensing and matrix completion with constant proportion of corruptions , author=. Constructive Approximation , volume=. 2013 , publisher=. doi:10.1007/s00365-012-9176-9 , URL=

work page doi:10.1007/s00365-012-9176-9 2013
[25]

Corrupted Sensing: Novel Guarantees for Separating Structured Signals , year=

Foygel, Rina and Mackey, Lester , journal=. Corrupted Sensing: Novel Guarantees for Separating Structured Signals , year=. doi:10.1109/TIT.2013.2293654 , URL=

work page doi:10.1109/tit.2013.2293654 2013
[26]

and Narayan, Akil , title =

Adcock, Ben and Bao, Anyi and Jakeman, John D. and Narayan, Akil , title =. SIAM/ASA Journal on Uncertainty Quantification , volume =. 2018 , doi =

2018
[27]

The Annals of Statistics , number =

Guillaume Lecu. The Annals of Statistics , number =. 2020 , doi =

2020
[28]

arXiv preprint arXiv:2103.10420 , year=

Robust-to-outliers square-root LASSO, simultaneous inference with a MOM approach , author=. arXiv preprint arXiv:2103.10420 , year=

work page arXiv
[29]

Consistent Robust Regression , url =

Bhatia, Kush and Jain, Prateek and Kamalaruban, Parameswaran and Kar, Purushottam , booktitle =. Consistent Robust Regression , url =
[30]

Proceedings of the Thirty-Second Conference on Learning Theory , pages =

Adaptive Hard Thresholding for Near-optimal Consistent Robust Regression , author =. Proceedings of the Thirty-Second Conference on Learning Theory , pages =. 2019 , editor =

2019
[31]

Robust Regression via Hard Thresholding , url =

Bhatia, Kush and Jain, Prateek and Kar, Purushottam , booktitle =. Robust Regression via Hard Thresholding , url =
[32]

A Theoretical Review of Modern Robust Statistics

Loh, Po-Ling. A Theoretical Review of Modern Robust Statistics. Annual Review of Statistics and Its Application. 2025. doi:https://doi.org/10.1146/annurev-statistics-112723-034446

work page doi:10.1146/annurev-statistics-112723-034446 2025
[33]

Journal of the American Statistical Association , volume =

Ankit Pensia and Varun Jog and Po-Ling Loh , title =. Journal of the American Statistical Association , volume =. 2025 , publisher =. doi:10.1080/01621459.2024.2392906 , URL =

work page doi:10.1080/01621459.2024.2392906 2025
[34]

Davies , keywords =

Thomas Blumensath and Mike E. Davies , keywords =. Iterative hard thresholding for compressed sensing , journal =. 2009 , issn =. doi:https://doi.org/10.1016/j.acha.2009.04.002 , url =

work page doi:10.1016/j.acha.2009.04.002 2009
[35]

Journal of Machine Learning Research , year =

Xiao-Tong Yuan and Ping Li and Tong Zhang , title =. Journal of Machine Learning Research , year =
[36]

On Iterative Hard Thresholding Methods for High-dimensional M-Estimation , url =

Jain, Prateek and Tewari, Ambuj and Kar, Purushottam , booktitle =. On Iterative Hard Thresholding Methods for High-dimensional M-Estimation , url =
[37]

Journal of Machine Learning Research , year =

Jian Huang and Yuling Jiao and Yanyan Liu and Xiliang Lu , title =. Journal of Machine Learning Research , year =
[38]

Proceedings of the 30th International Conference on Algorithmic Learning Theory , pages =

Interplay of minimax estimation and minimax support recovery under sparsity , author =. Proceedings of the 30th International Conference on Algorithmic Learning Theory , pages =. 2019 , editor =

2019
[39]

arXiv preprint arXiv:2008.12236 , year=

Scaled minimax optimality in high-dimensional linear regression: A non-convex algorithmic regularization approach , author=. arXiv preprint arXiv:2008.12236 , year=

work page arXiv 2008
[40]

Nonconcave Penalized Likelihood With NP-Dimensionality , year=

Fan, Jianqing and Lv, Jinchi , journal=. Nonconcave Penalized Likelihood With NP-Dimensionality , year=
[41]

The Annals of Statistics , number =

Jianqing Fan and Lingzhou Xue and Hui Zou , title =. The Annals of Statistics , number =. 2014 , doi =

2014
[42]

The impact of contamination and correlated design on the Lasso: An average case analysis , journal =

Stanislav Minsker and Yiqiu Shen , keywords =. The impact of contamination and correlated design on the Lasso: An average case analysis , journal =. 2025 , issn =. doi:https://doi.org/10.1016/j.spl.2025.110417 , url =

work page doi:10.1016/j.spl.2025.110417 2025
[43]

Journal of Global Optimization , volume=

ORCA: Outlier detection and Robust Clustering for Attributed graphs , author=. Journal of Global Optimization , volume=. 2021 , publisher=. doi:10.1007/s10898-021-01024-z , URL =

work page doi:10.1007/s10898-021-01024-z 2021
[44]

The Annals of statistics , volume=

Bridging convex and nonconvex optimization in robust PCA: Noise, outliers, and missing data , author=. The Annals of statistics , volume=. 2021 , doi =

2021
[45]

2025 , month =

Ma, Mengru and Shang, Jiangwei , title =. 2025 , month =. doi:10.1088/1367-2630/adcfbf , url =

work page doi:10.1088/1367-2630/adcfbf 2025
[46]

Stepanova and Alexandre B

Cristina Butucea and Mohamed Ndaoud and Natalia A. Stepanova and Alexandre B. Tsybakov , title =. The Annals of Statistics , number =. 2018 , doi =

2018
[47]

The Annals of Statistics , number =

Bradley Efron and Trevor Hastie and Iain Johnstone and Robert Tibshirani , title =. The Annals of Statistics , number =. 2004 , doi =

2004
[48]

doi: 10.1080/01621459.2018.1482755

Qiang Sun and Wen-Xin Zhou and Jianqing Fan , title =. Journal of the American Statistical Association , volume =. 2020 , publisher =. doi:10.1080/01621459.2018.1543124 , URL =

work page doi:10.1080/01621459.2018.1543124 2020
[49]

Bernoulli , number =

Saptarshi Roy and Ambuj Tewari and Ziwei Zhu , title =. Bernoulli , number =. 2025 , doi =

2025
[50]

Biometrika , volume =

Zhao, Peng and Yang, Yun and He, Qiao-Chu , title =. Biometrika , volume =. 2022 , abstract =. doi:10.1093/biomet/asac010 , url =

work page doi:10.1093/biomet/asac010 2022
[51]

Journal of the Royal Statistical Society Series B: Statistical Methodology , volume =

Tan, Kean Ming and Wang, Lan and Zhou, Wen-Xin , title =. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume =. 2021 , issn =. doi:10.1111/rssb.12485 , url =

work page doi:10.1111/rssb.12485 2021
[52]

, Moran , Gemma E G

Yiyuan She and Zhifeng Wang and Jiahui Shen , title =. Journal of the American Statistical Association , volume =. 2022 , publisher =. doi:10.1080/01621459.2020.1850460 , URL =

work page doi:10.1080/01621459.2020.1850460 2022
[53]

arXiv preprint arXiv:1804.01230 , year=

The noise barrier and the large signal bias of the lasso and other convex estimators , author=. arXiv preprint arXiv:1804.01230 , year=

work page arXiv
[54]

Model Selection and Minimax Estimation in Generalized Linear Models , year=

Abramovich, Felix and Grinshtein, Vadim , journal=. Model Selection and Minimax Estimation in Generalized Linear Models , year=
[55]

2006 , publisher=

Theory of point estimation , author=. 2006 , publisher=

2006
[56]

and Yu, Bin , journal=

Raskutti, Garvesh and Wainwright, Martin J. and Yu, Bin , journal=. Minimax Rates of Estimation for High-Dimensional Linear Regression Over _q -Balls , year=
[57]

2018 , publisher=

Foundations of Machine Learning, second edition , author=. 2018 , publisher=

2018
[58]

Information-theoretic bounds on sparsity recovery in the high-dimensional and noisy setting , year=

Wainwright, Martin , booktitle=. Information-theoretic bounds on sparsity recovery in the high-dimensional and noisy setting , year=
[59]

, Title =

Tsybakov, Alexandre B. , Title =. 2009 , Publisher =

2009
[60]

Bondell and Yichao Wu , journal =

Dehan Kong and Howard D. Bondell and Yichao Wu , journal =. FULLY EFFICIENT ROBUST ESTIMATION, OUTLIER DETECTION AND VARIABLE SELECTION VIA PENALIZED REGRESSION , urldate =
[61]

Introduction to the non-asymptotic analysis of random matrices

Introduction to the non-asymptotic analysis of random matrices , author=. arXiv preprint arXiv:1011.3027 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[62]

arXiv preprint arXiv:2004.05990 , year=

Robust estimation with Lasso when outputs are adversarially contaminated , author=. arXiv preprint arXiv:2004.05990 , year=

work page arXiv 2004
[63]

Journal of Machine Learning Research , year =

Sasai, Takeyuki and Fujisawa, Hironori , title =. Journal of Machine Learning Research , year =
[64]

ICLR 2025 Workshop on Foundation Models in the Wild , year=

Detecting covariate shifts with vision-language foundation models , author=. ICLR 2025 Workshop on Foundation Models in the Wild , year=

2025
[65]

American Economic Review , Volume =

Goldsmith-Pinkham, Paul and Hull, Peter and Kolesár, Michal , Title =. American Economic Review , Volume =. 2024 , Pages =. doi:10.1257/aer.20221116 , URL =

work page doi:10.1257/aer.20221116 2024
[66]

Journal of Fourier analysis and Applications , volume=

Iterative thresholding for sparse approximations , author=. Journal of Fourier analysis and Applications , volume=. 2008 , publisher=

2008
[67]

Information and Inference: A Journal of the IMA , volume =

Liu, Haoyang and Foygel Barber, Rina , title =. Information and Inference: A Journal of the IMA , volume =. 2019 , issn =. doi:10.1093/imaiai/iaz027 , url =

work page doi:10.1093/imaiai/iaz027 2019
[68]

Slow Kill for Big Data Learning , year=

She, Yiyuan and Shen, Jiahui and Barbu, Adrian , journal=. Slow Kill for Big Data Learning , year=
[69]

The Annals of Statistics , number =

Zongming Ma , title =. The Annals of Statistics , number =
[70]

The Annals of Statistics , number =

Mohamed Ndaoud , title =. The Annals of Statistics , number =
[71]

Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection , volume =

Bai, Yu and Chen, Fan and Wang, Huan and Xiong, Caiming and Mei, Song , booktitle =. Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection , volume =
[72]

Journal of the American Statistical Association , volume =

Jianqing Fan and Zhuoran Yang and Mengxin Yu , title =. Journal of the American Statistical Association , volume =. 2023 , publisher =

2023
[73]

Lee and Xin T

Xi Chen and Jason D. Lee and Xin T. Tong and Yichen Zhang , title =. The Annals of Statistics , number =. 2020 , doi =

2020
[74]

The Annals of Statistics , number =

Kweku Abraham and Isma. The Annals of Statistics , number =. 2024 , doi =

2024
[75]

Tsybakov , title =

Karim Lounici and Massimiliano Pontil and Sara van de Geer and Alexandre B. Tsybakov , title =. The Annals of Statistics , number =. 2011 , doi =

2011
[76]

, journal=

Ndaoud, Mohamed and Tsybakov, Alexandre B. , journal=. Optimal Variable Selection and Adaptive Noisy Compressed Sensing , year=
[77]

The Annals of Statistics , number =

Jianqing Fan and Han Liu and Qiang Sun and Tong Zhang , title =. The Annals of Statistics , number =. 2018 , doi =

2018
[78]

Bellec and Guillaume Lecu

Pierre C. Bellec and Guillaume Lecu. The Annals of Statistics , number =. 2018 , doi =

2018
[79]

2019 , MONTH = Jul, KEYWORDS =

Ndaoud, Mohamed , URL =. 2019 , MONTH = Jul, KEYWORDS =

2019
[80]

Tsybakov , title =

Cristina Butucea and Enno Mammen and Mohamed Ndaoud and Alexandre B. Tsybakov , title =. The Annals of Statistics , number =. 2023 , doi =

2023

Showing first 80 references.

[1] [1]

Optimal Estimation of the Null Distribution in Large-Scale Inference , year=

Kotekal, Subhodh and Gao, Chao , journal=. Optimal Estimation of the Null Distribution in Large-Scale Inference , year=

[2] [2]

Outlier-robust estimation of a sparse linear model using _1 -penalized Huber s M-estimator , url =

Dalalyan, Arnak and Thompson, Philip , booktitle =. Outlier-robust estimation of a sparse linear model using _1 -penalized Huber s M-estimator , url =

[3] [3]

Electronic Journal of Statistics , number =

Geoffrey Chinot , title =. Electronic Journal of Statistics , number =. 2020 , doi =

2020

[4] [4]

SIAM Journal on Mathematics of Data Science , volume =

Minsker, Stanislav and Ndaoud, Mohamed and Wang, Lang , title =. SIAM Journal on Mathematics of Data Science , volume =. 2024 , doi =

2024

[5] [5]

The nonparametric Box–Cox model for high-dimensional regression analysis , journal =

He Zhou and Hui Zou , keywords =. The nonparametric Box–Cox model for high-dimensional regression analysis , journal =. 2024 , issn =. doi:https://doi.org/10.1016/j.jeconom.2023.01.025 , url =

work page doi:10.1016/j.jeconom.2023.01.025 2024

[6] [6]

2024 , eprint=

A Non-Parametric Box-Cox Approach to Robustifying High-Dimensional Linear Hypothesis Testing , author=. 2024 , eprint=

2024

[7] [7]

The Annals of Statistics , number =

Yinan Shen and Jingyang Li and Jian-Feng Cai and Dong Xia , title =. The Annals of Statistics , number =. 2025 , doi =

2025

[8] [8]

Bernoulli , number =

Chao Gao , title =. Bernoulli , number =. 2020 , doi =

2020

[9] [9]

Electronic Journal of Statistics , number =

Mengjie Chen and Chao Gao and Zhao Ren , title =. Electronic Journal of Statistics , number =. 2016 , doi =

2016

[10] [10]

arXiv preprint arXiv:2412.19183 , year=

Outlier-Bias Removal with Alpha Divergence: A Robust Non-Convex Estimator for Linear Regression , author=. arXiv preprint arXiv:2412.19183 , year=

work page arXiv

[11] [11]

Electronic Communications in Probability , number =

Daniel Hsu and Sham Kakade and Tong Zhang , title =. Electronic Communications in Probability , number =. 2012 , doi =

2012

[12] [12]

2019 , publisher=

High-dimensional statistics: A non-asymptotic viewpoint , author=. 2019 , publisher=

2019

[13] [13]

The Annals of Statistics , number =

Mengjie Chen and Chao Gao and Zhao Ren , title =. The Annals of Statistics , number =. 2018 , doi =

2018

[14] [14]

Huber , title =

Peter J. Huber , title =. The Annals of Mathematical Statistics , number =. 1964 , doi =

1964

[15] [15]

and Tseng, P

Sardy, S. and Tseng, P. and Bruce, A. , journal=. Robust wavelet denoising , year=. doi:10.1109/78.923297 , URL=

work page doi:10.1109/78.923297

[16] [16]

Tyrrell Rockafellar , title =

R. Tyrrell Rockafellar , title =. 1997 , pages =

1997

[17] [17]

Wiley Series in Probability and Mathematical Statistics , year=

Robust statistics , author=. Wiley Series in Probability and Mathematical Statistics , year=

[18] [18]

Statistics and Computing , volume=

Robust estimation and wavelet thresholding in partially linear models , author=. Statistics and Computing , volume=. 2007 , publisher=. doi:10.1007/s11222-007-9019-x , URL =

work page doi:10.1007/s11222-007-9019-x 2007

[19] [19]

2007 , doi =

Robust variable selection using least angle regression and elemental set sampling , journal =. 2007 , doi =

2007

[20] [20]

Owen , title =

Yiyuan She and Art B. Owen , title =. Journal of the American Statistical Association , volume =. 2011 , publisher =. doi:10.1198/jasa.2011.tm10390 , URL =

work page doi:10.1198/jasa.2011.tm10390 2011

[21] [21]

MacEachern and Yoonsuh Jung , title =

Yoonkyung Lee and Steven N. MacEachern and Yoonsuh Jung , title =. Statistical Science , number =. 2012 , doi =

2012

[22] [22]

arXiv preprint arXiv:2012.06750 , year=

Outlier-robust sparse/low-rank least-squares regression and robust matrix completion , author=. arXiv preprint arXiv:2012.06750 , year=

work page arXiv 2012

[23] [23]

and Tran, Trac D

Nguyen, Nam H. and Tran, Trac D. , journal=. Robust Lasso With Missing and Grossly Corrupted Observations , year=. doi:10.1109/TIT.2012.2232347 , URL=

work page doi:10.1109/tit.2012.2232347 2012

[24] [24]

Constructive Approximation , volume=

Compressed sensing and matrix completion with constant proportion of corruptions , author=. Constructive Approximation , volume=. 2013 , publisher=. doi:10.1007/s00365-012-9176-9 , URL=

work page doi:10.1007/s00365-012-9176-9 2013

[25] [25]

Corrupted Sensing: Novel Guarantees for Separating Structured Signals , year=

Foygel, Rina and Mackey, Lester , journal=. Corrupted Sensing: Novel Guarantees for Separating Structured Signals , year=. doi:10.1109/TIT.2013.2293654 , URL=

work page doi:10.1109/tit.2013.2293654 2013

[26] [26]

and Narayan, Akil , title =

Adcock, Ben and Bao, Anyi and Jakeman, John D. and Narayan, Akil , title =. SIAM/ASA Journal on Uncertainty Quantification , volume =. 2018 , doi =

2018

[27] [27]

The Annals of Statistics , number =

Guillaume Lecu. The Annals of Statistics , number =. 2020 , doi =

2020

[28] [28]

arXiv preprint arXiv:2103.10420 , year=

Robust-to-outliers square-root LASSO, simultaneous inference with a MOM approach , author=. arXiv preprint arXiv:2103.10420 , year=

work page arXiv

[29] [29]

Consistent Robust Regression , url =

Bhatia, Kush and Jain, Prateek and Kamalaruban, Parameswaran and Kar, Purushottam , booktitle =. Consistent Robust Regression , url =

[30] [30]

Proceedings of the Thirty-Second Conference on Learning Theory , pages =

Adaptive Hard Thresholding for Near-optimal Consistent Robust Regression , author =. Proceedings of the Thirty-Second Conference on Learning Theory , pages =. 2019 , editor =

2019

[31] [31]

Robust Regression via Hard Thresholding , url =

Bhatia, Kush and Jain, Prateek and Kar, Purushottam , booktitle =. Robust Regression via Hard Thresholding , url =

[32] [32]

A Theoretical Review of Modern Robust Statistics

Loh, Po-Ling. A Theoretical Review of Modern Robust Statistics. Annual Review of Statistics and Its Application. 2025. doi:https://doi.org/10.1146/annurev-statistics-112723-034446

work page doi:10.1146/annurev-statistics-112723-034446 2025

[33] [33]

Journal of the American Statistical Association , volume =

Ankit Pensia and Varun Jog and Po-Ling Loh , title =. Journal of the American Statistical Association , volume =. 2025 , publisher =. doi:10.1080/01621459.2024.2392906 , URL =

work page doi:10.1080/01621459.2024.2392906 2025

[34] [34]

Davies , keywords =

Thomas Blumensath and Mike E. Davies , keywords =. Iterative hard thresholding for compressed sensing , journal =. 2009 , issn =. doi:https://doi.org/10.1016/j.acha.2009.04.002 , url =

work page doi:10.1016/j.acha.2009.04.002 2009

[35] [35]

Journal of Machine Learning Research , year =

Xiao-Tong Yuan and Ping Li and Tong Zhang , title =. Journal of Machine Learning Research , year =

[36] [36]

On Iterative Hard Thresholding Methods for High-dimensional M-Estimation , url =

Jain, Prateek and Tewari, Ambuj and Kar, Purushottam , booktitle =. On Iterative Hard Thresholding Methods for High-dimensional M-Estimation , url =

[37] [37]

Journal of Machine Learning Research , year =

Jian Huang and Yuling Jiao and Yanyan Liu and Xiliang Lu , title =. Journal of Machine Learning Research , year =

[38] [38]

Proceedings of the 30th International Conference on Algorithmic Learning Theory , pages =

Interplay of minimax estimation and minimax support recovery under sparsity , author =. Proceedings of the 30th International Conference on Algorithmic Learning Theory , pages =. 2019 , editor =

2019

[39] [39]

arXiv preprint arXiv:2008.12236 , year=

Scaled minimax optimality in high-dimensional linear regression: A non-convex algorithmic regularization approach , author=. arXiv preprint arXiv:2008.12236 , year=

work page arXiv 2008

[40] [40]

Nonconcave Penalized Likelihood With NP-Dimensionality , year=

Fan, Jianqing and Lv, Jinchi , journal=. Nonconcave Penalized Likelihood With NP-Dimensionality , year=

[41] [41]

The Annals of Statistics , number =

Jianqing Fan and Lingzhou Xue and Hui Zou , title =. The Annals of Statistics , number =. 2014 , doi =

2014

[42] [42]

The impact of contamination and correlated design on the Lasso: An average case analysis , journal =

Stanislav Minsker and Yiqiu Shen , keywords =. The impact of contamination and correlated design on the Lasso: An average case analysis , journal =. 2025 , issn =. doi:https://doi.org/10.1016/j.spl.2025.110417 , url =

work page doi:10.1016/j.spl.2025.110417 2025

[43] [43]

Journal of Global Optimization , volume=

ORCA: Outlier detection and Robust Clustering for Attributed graphs , author=. Journal of Global Optimization , volume=. 2021 , publisher=. doi:10.1007/s10898-021-01024-z , URL =

work page doi:10.1007/s10898-021-01024-z 2021

[44] [44]

The Annals of statistics , volume=

Bridging convex and nonconvex optimization in robust PCA: Noise, outliers, and missing data , author=. The Annals of statistics , volume=. 2021 , doi =

2021

[45] [45]

2025 , month =

Ma, Mengru and Shang, Jiangwei , title =. 2025 , month =. doi:10.1088/1367-2630/adcfbf , url =

work page doi:10.1088/1367-2630/adcfbf 2025

[46] [46]

Stepanova and Alexandre B

Cristina Butucea and Mohamed Ndaoud and Natalia A. Stepanova and Alexandre B. Tsybakov , title =. The Annals of Statistics , number =. 2018 , doi =

2018

[47] [47]

The Annals of Statistics , number =

Bradley Efron and Trevor Hastie and Iain Johnstone and Robert Tibshirani , title =. The Annals of Statistics , number =. 2004 , doi =

2004

[48] [48]

doi: 10.1080/01621459.2018.1482755

Qiang Sun and Wen-Xin Zhou and Jianqing Fan , title =. Journal of the American Statistical Association , volume =. 2020 , publisher =. doi:10.1080/01621459.2018.1543124 , URL =

work page doi:10.1080/01621459.2018.1543124 2020

[49] [49]

Bernoulli , number =

Saptarshi Roy and Ambuj Tewari and Ziwei Zhu , title =. Bernoulli , number =. 2025 , doi =

2025

[50] [50]

Biometrika , volume =

Zhao, Peng and Yang, Yun and He, Qiao-Chu , title =. Biometrika , volume =. 2022 , abstract =. doi:10.1093/biomet/asac010 , url =

work page doi:10.1093/biomet/asac010 2022

[51] [51]

Journal of the Royal Statistical Society Series B: Statistical Methodology , volume =

Tan, Kean Ming and Wang, Lan and Zhou, Wen-Xin , title =. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume =. 2021 , issn =. doi:10.1111/rssb.12485 , url =

work page doi:10.1111/rssb.12485 2021

[52] [52]

, Moran , Gemma E G

Yiyuan She and Zhifeng Wang and Jiahui Shen , title =. Journal of the American Statistical Association , volume =. 2022 , publisher =. doi:10.1080/01621459.2020.1850460 , URL =

work page doi:10.1080/01621459.2020.1850460 2022

[53] [53]

arXiv preprint arXiv:1804.01230 , year=

The noise barrier and the large signal bias of the lasso and other convex estimators , author=. arXiv preprint arXiv:1804.01230 , year=

work page arXiv

[54] [54]

Model Selection and Minimax Estimation in Generalized Linear Models , year=

Abramovich, Felix and Grinshtein, Vadim , journal=. Model Selection and Minimax Estimation in Generalized Linear Models , year=

[55] [55]

2006 , publisher=

Theory of point estimation , author=. 2006 , publisher=

2006

[56] [56]

and Yu, Bin , journal=

Raskutti, Garvesh and Wainwright, Martin J. and Yu, Bin , journal=. Minimax Rates of Estimation for High-Dimensional Linear Regression Over _q -Balls , year=

[57] [57]

2018 , publisher=

Foundations of Machine Learning, second edition , author=. 2018 , publisher=

2018

[58] [58]

Information-theoretic bounds on sparsity recovery in the high-dimensional and noisy setting , year=

Wainwright, Martin , booktitle=. Information-theoretic bounds on sparsity recovery in the high-dimensional and noisy setting , year=

[59] [59]

, Title =

Tsybakov, Alexandre B. , Title =. 2009 , Publisher =

2009

[60] [60]

Bondell and Yichao Wu , journal =

Dehan Kong and Howard D. Bondell and Yichao Wu , journal =. FULLY EFFICIENT ROBUST ESTIMATION, OUTLIER DETECTION AND VARIABLE SELECTION VIA PENALIZED REGRESSION , urldate =

[61] [61]

Introduction to the non-asymptotic analysis of random matrices

Introduction to the non-asymptotic analysis of random matrices , author=. arXiv preprint arXiv:1011.3027 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[62] [62]

arXiv preprint arXiv:2004.05990 , year=

Robust estimation with Lasso when outputs are adversarially contaminated , author=. arXiv preprint arXiv:2004.05990 , year=

work page arXiv 2004

[63] [63]

Journal of Machine Learning Research , year =

Sasai, Takeyuki and Fujisawa, Hironori , title =. Journal of Machine Learning Research , year =

[64] [64]

ICLR 2025 Workshop on Foundation Models in the Wild , year=

Detecting covariate shifts with vision-language foundation models , author=. ICLR 2025 Workshop on Foundation Models in the Wild , year=

2025

[65] [65]

American Economic Review , Volume =

Goldsmith-Pinkham, Paul and Hull, Peter and Kolesár, Michal , Title =. American Economic Review , Volume =. 2024 , Pages =. doi:10.1257/aer.20221116 , URL =

work page doi:10.1257/aer.20221116 2024

[66] [66]

Journal of Fourier analysis and Applications , volume=

Iterative thresholding for sparse approximations , author=. Journal of Fourier analysis and Applications , volume=. 2008 , publisher=

2008

[67] [67]

Information and Inference: A Journal of the IMA , volume =

Liu, Haoyang and Foygel Barber, Rina , title =. Information and Inference: A Journal of the IMA , volume =. 2019 , issn =. doi:10.1093/imaiai/iaz027 , url =

work page doi:10.1093/imaiai/iaz027 2019

[68] [68]

Slow Kill for Big Data Learning , year=

She, Yiyuan and Shen, Jiahui and Barbu, Adrian , journal=. Slow Kill for Big Data Learning , year=

[69] [69]

The Annals of Statistics , number =

Zongming Ma , title =. The Annals of Statistics , number =

[70] [70]

The Annals of Statistics , number =

Mohamed Ndaoud , title =. The Annals of Statistics , number =

[71] [71]

Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection , volume =

Bai, Yu and Chen, Fan and Wang, Huan and Xiong, Caiming and Mei, Song , booktitle =. Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection , volume =

[72] [72]

Journal of the American Statistical Association , volume =

Jianqing Fan and Zhuoran Yang and Mengxin Yu , title =. Journal of the American Statistical Association , volume =. 2023 , publisher =

2023

[73] [73]

Lee and Xin T

Xi Chen and Jason D. Lee and Xin T. Tong and Yichen Zhang , title =. The Annals of Statistics , number =. 2020 , doi =

2020

[74] [74]

The Annals of Statistics , number =

Kweku Abraham and Isma. The Annals of Statistics , number =. 2024 , doi =

2024

[75] [75]

Tsybakov , title =

Karim Lounici and Massimiliano Pontil and Sara van de Geer and Alexandre B. Tsybakov , title =. The Annals of Statistics , number =. 2011 , doi =

2011

[76] [76]

, journal=

Ndaoud, Mohamed and Tsybakov, Alexandre B. , journal=. Optimal Variable Selection and Adaptive Noisy Compressed Sensing , year=

[77] [77]

The Annals of Statistics , number =

Jianqing Fan and Han Liu and Qiang Sun and Tong Zhang , title =. The Annals of Statistics , number =. 2018 , doi =

2018

[78] [78]

Bellec and Guillaume Lecu

Pierre C. Bellec and Guillaume Lecu. The Annals of Statistics , number =. 2018 , doi =

2018

[79] [79]

2019 , MONTH = Jul, KEYWORDS =

Ndaoud, Mohamed , URL =. 2019 , MONTH = Jul, KEYWORDS =

2019

[80] [80]

Tsybakov , title =

Cristina Butucea and Enno Mammen and Mohamed Ndaoud and Alexandre B. Tsybakov , title =. The Annals of Statistics , number =. 2023 , doi =

2023