Recognition: unknown
Implicit Regularization in Matrix Factorization
classification
📊 stat.ML
cs.LG
keywords
factorizationdescentenoughgradientimplicitmatrixregularizationclose
read the original abstract
We study implicit regularization when optimizing an underdetermined quadratic objective over a matrix $X$ with gradient descent on a factorization of $X$. We conjecture and provide empirical and theoretical evidence that with small enough step sizes and initialization close enough to the origin, gradient descent on a full dimensional factorization converges to the minimum nuclear norm solution.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Estimating Implicit Regularization in Deep Learning
Gradient matching empirically recovers implicit regularization effects such as l2 penalties from early stopping and dropout in neural networks.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.