Learning curves for sgd on structured features.arXiv preprint arXiv:2106.02713, 2021

Blake Bordelon, Cengiz Pehlevan · 2021 · arXiv 2106.02713

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Homogenization of $\ell_2$-Adversarial Training in High-Dimensions: Exact Dynamics under Stochastic Gradient Descent

math.OC · 2026-06-30 · unverdicted · novelty 7.0

Derives ODE deterministic equivalents and an adversarial homogenized SDE for SGD iterates in high-dim ℓ2-adversarial training, showing no constant learning rate ensures monotone descent for single-class adversarial least squares and equivalence to adaptive regularized standard SGD.

Neural Scaling Universality: If Exponents Are Fixed, Time to Understand Coefficients

cs.LG · 2026-06-23 · unverdicted · novelty 4.0

Position paper claims fixed exponents in scaling laws arise from generic mechanisms while coefficients vary with data and architecture, making the latter the focus for improvements.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Homogenization of $\ell_2$-Adversarial Training in High-Dimensions: Exact Dynamics under Stochastic Gradient Descent math.OC · 2026-06-30 · unverdicted · none · ref 12
Derives ODE deterministic equivalents and an adversarial homogenized SDE for SGD iterates in high-dim ℓ2-adversarial training, showing no constant learning rate ensures monotone descent for single-class adversarial least squares and equivalence to adaptive regularized standard SGD.
Neural Scaling Universality: If Exponents Are Fixed, Time to Understand Coefficients cs.LG · 2026-06-23 · unverdicted · none · ref 33
Position paper claims fixed exponents in scaling laws arise from generic mechanisms while coefficients vary with data and architecture, making the latter the focus for improvements.

Learning curves for sgd on structured features.arXiv preprint arXiv:2106.02713, 2021

fields

years

verdicts

representative citing papers

citing papers explorer