NeurIPS 2019; arXiv preprint matches proceedings

Alessio Ansuini, Alessandro Laio, Jakob H · 1905 · arXiv 1905.12784

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

UR-JEPA: Uniform Rectifiability as a Regularizer for Joint-Embedding Predictive Architectures

cs.LG · 2026-05-31 · unverdicted · novelty 7.0

UR-JEPA applies uniform rectifiability regularization via a smoothed Carleson square function to JEPA training, producing embeddings with 4-5 order PCA spectral drop at dimension 20-25 and lower seed variance than Gaussian regularization on Inet10, Galaxy10, and EuroSAT.

Patnaik-Pearson intrinsic dimension for internal representations of neural networks

math.ST · 2026-06-17 · unverdicted · novelty 6.0 · 2 refs

Introduces the Patnaik-Pearson intrinsic dimension estimator, proves some of its properties, relates it to HTSR/SETOL for Pareto spectra, and applies it to track embedding dimension evolution in BERT-base and DeepSeek-R1-Distill-Qwen-1.

The Geometry of Last-Layer Model Stealing

cs.LG · 2026-06-05 · unverdicted · novelty 3.0

Geometry maps the conditions for perfect last-layer theft in transformers and demonstrates that full hidden-network reverse engineering is impossible from final outputs.

The Long Delay to Arithmetic Generalization: When Learned Representations Outrun Behavior

cs.LG · 2026-03-30

citing papers explorer

Showing 4 of 4 citing papers.

UR-JEPA: Uniform Rectifiability as a Regularizer for Joint-Embedding Predictive Architectures cs.LG · 2026-05-31 · unverdicted · none · ref 22
UR-JEPA applies uniform rectifiability regularization via a smoothed Carleson square function to JEPA training, producing embeddings with 4-5 order PCA spectral drop at dimension 20-25 and lower seed variance than Gaussian regularization on Inet10, Galaxy10, and EuroSAT.
Patnaik-Pearson intrinsic dimension for internal representations of neural networks math.ST · 2026-06-17 · unverdicted · none · ref 1 · 2 links
Introduces the Patnaik-Pearson intrinsic dimension estimator, proves some of its properties, relates it to HTSR/SETOL for Pareto spectra, and applies it to track embedding dimension evolution in BERT-base and DeepSeek-R1-Distill-Qwen-1.
The Geometry of Last-Layer Model Stealing cs.LG · 2026-06-05 · unverdicted · none · ref 5
Geometry maps the conditions for perfect last-layer theft in transformers and demonstrates that full hidden-network reverse engineering is impossible from final outputs.
The Long Delay to Arithmetic Generalization: When Learned Representations Outrun Behavior cs.LG · 2026-03-30 · unreviewed · ref 2

NeurIPS 2019; arXiv preprint matches proceedings

fields

years

verdicts

representative citing papers

citing papers explorer