pith. sign in

arxiv: 1307.1827 · v7 · pith:I66L3LJPnew · submitted 2013-07-07 · 💻 cs.LG · stat.ML

Loss minimization and parameter estimation with heavy tails

classification 💻 cs.LG stat.ML
keywords estimationtechniqueapplicationsboundeddeltadistributionsestimatorleast
0
0 comments X
read the original abstract

This work studies applications and generalizations of a simple estimation technique that provides exponential concentration under heavy-tailed distributions, assuming only bounded low-order moments. We show that the technique can be used for approximate minimization of smooth and strongly convex losses, and specifically for least squares linear regression. For instance, our $d$-dimensional estimator requires just $\tilde{O}(d\log(1/\delta))$ random samples to obtain a constant factor approximation to the optimal least squares loss with probability $1-\delta$, without requiring the covariates or noise to be bounded or subgaussian. We provide further applications to sparse linear regression and low-rank covariance matrix estimation with similar allowances on the noise and covariate distributions. The core technique is a generalization of the median-of-means estimator to arbitrary metric spaces.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.