On the existence of consistent adversarial attacks in high-dimensional linear classification

· 2025 · stat.ML · arXiv 2506.12454

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

What fundamentally distinguishes an adversarial attack from a misclassification due to limited model expressivity or finite data? In this work, we investigate this question in the setting of high-dimensional binary classification, where statistical effects due to limited data availability play a central role. We introduce a new error metric that precisely capture this distinction, quantifying model vulnerability to consistent adversarial attacks -- perturbations that preserve the ground-truth labels. Our main technical contribution is an exact and rigorous asymptotic characterization of these metrics in both well-specified models and latent space models, revealing different vulnerability patterns compared to standard robust error measures. The theoretical results demonstrate that as models become more overparameterized, their vulnerability to label-preserving perturbations grows, offering theoretical insight into the mechanisms underlying model sensitivity to adversarial attacks.

representative citing papers

Explaining Machine Learning and Memorization with Statistical Mechanics

cs.LG · 2026-06-30 · unverdicted · novelty 3.0

Thesis uses statistical mechanics to study DAM and RBM models for understanding memorization, low-dimensional learning, and adversarial robustness in neural networks.

citing papers explorer

Showing 1 of 1 citing paper.

Explaining Machine Learning and Memorization with Statistical Mechanics cs.LG · 2026-06-30 · unverdicted · none · ref 20 · internal anchor
Thesis uses statistical mechanics to study DAM and RBM models for understanding memorization, low-dimensional learning, and adversarial robustness in neural networks.

On the existence of consistent adversarial attacks in high-dimensional linear classification

fields

years

verdicts

representative citing papers

citing papers explorer