pith. machine review for the scientific record. sign in

arxiv: 1708.00502 · v3 · submitted 2017-08-01 · 🧮 math.ST · stat.TH

Recognition: unknown

Estimation of the covariance structure of heavy-tailed distributions

Authors on Pith no claims yet
classification 🧮 math.ST stat.TH
keywords covariancedistributionsmatrixcasedatadimensionestimationestimator
0
0 comments X
read the original abstract

We propose and analyze a new estimator of the covariance matrix that admits strong theoretical guarantees under weak assumptions on the underlying distribution, such as existence of moments of only low order. While estimation of covariance matrices corresponding to sub-Gaussian distributions is well-understood, much less in known in the case of heavy-tailed data. As K. Balasubramanian and M. Yuan write, "data from real-world experiments oftentimes tend to be corrupted with outliers and/or exhibit heavy tails. In such cases, it is not clear that those covariance matrix estimators .. remain optimal" and "..what are the other possible strategies to deal with heavy tailed distributions warrant further studies." We make a step towards answering this question and prove tight deviation inequalities for the proposed estimator that depend only on the parameters controlling the "intrinsic dimension" associated to the covariance matrix (as opposed to the dimension of the ambient space); in particular, our results are applicable in the case of high-dimensional observations.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Detection Is Harder Than Estimation in Certain Regimes: Inference for Moment and Cumulant Tensors

    math.ST 2026-03 accept novelty 8.0

    The minimax rate for estimating d-th order moment tensors is sqrt(p/n) wedge 1, while low-degree evidence shows detection of vanishing cumulants is hard for n much less than p to the d/2, creating a reverse detection-...