Beyond Least-Squares: Fast Rates for Regularized Empirical Risk Minimization through Self-Concordance

Alessandro Rudi (PSL; Dmitrii Ostrovskii (PSL; Francis Bach (PSL; SIERRA); Ulysse Marteau-Ferey (PSL

arxiv: 1902.03046 · v3 · pith:GGO6KTDXnew · submitted 2019-02-08 · 💻 cs.LG · cs.AI· math.ST· stat.TH

Beyond Least-Squares: Fast Rates for Regularized Empirical Risk Minimization through Self-Concordance

Ulysse Marteau-Ferey (PSL , SIERRA) , Dmitrii Ostrovskii (PSL , Francis Bach (PSL , Alessandro Rudi (PSL This is my paper

classification 💻 cs.LG cs.AImath.STstat.TH

keywords least-squaresratesriskbeyondconvergencederivativesempiricalfast

0 comments

read the original abstract

We consider learning methods based on the regularization of a convex empirical risk by a squared Hilbertian norm, a setting that includes linear predictors and non-linear predictors through positive-definite kernels. In order to go beyond the generic analysis leading to convergence rates of the excess risk as $O(1/\sqrt{n})$ from $n$ observations, we assume that the individual losses are self-concordant, that is, their third-order derivatives are bounded by their second-order derivatives. This setting includes least-squares, as well as all generalized linear models such as logistic and softmax regression. For this class of losses, we provide a bias-variance decomposition and show that the assumptions commonly made in least-squares regression, such as the source and capacity conditions, can be adapted to obtain fast non-asymptotic rates of convergence by improving the bias terms, the variance terms or both.

This paper has not been read by Pith yet.

Beyond Least-Squares: Fast Rates for Regularized Empirical Risk Minimization through Self-Concordance

discussion (0)