Derives geodesic ridge regularization and Riemannian Gibbs Process prior for feature-learning wide neural networks, generalizing kernel-regime results via function-space axiomatization.
Smooth manifolds
4 Pith papers cite this work. Polarity classification is still indexing.
representative citing papers
The Lebesgue measure of ε-forging sets decays as O(ε) or ε^d for linear models and as ε^{(d-r)/2} under mild regularity assumptions, with vanishing probability of random sampling.
Manifold constraints via the new MACRO optimizer independently bound activation scales and enforce rotational equilibrium in LLM pre-training, subsuming RMS normalization and decoupled weight decay while delivering competitive performance with convergence guarantees.
citing papers explorer
-
Canonical Regularisation of Wide Feature-Learning Neural Networks
Derives geodesic ridge regularization and Riemannian Gibbs Process prior for feature-learning wide neural networks, generalizing kernel-regime results via function-space axiomatization.
-
The Measure of Deception: An Analysis of Data Forging in Machine Unlearning
The Lebesgue measure of ε-forging sets decays as O(ε) or ε^d for linear models and as ε^{(d-r)/2} under mild regularity assumptions, with vanishing probability of random sampling.
-
Demystifying Manifold Constraints in LLM Pre-training
Manifold constraints via the new MACRO optimizer independently bound activation scales and enforce rotational equilibrium in LLM pre-training, subsuming RMS normalization and decoupled weight decay while delivering competitive performance with convergence guarantees.
- Gauge Symmetries, Contact Reduction, and Singular Field Theories