pith. sign in

arxiv: 1301.5035 · v1 · pith:5XYZ5CEYnew · submitted 2013-01-21 · 📊 stat.CO

Computing Robust Leverage Diagnostics when the Design Matrix Contains Coded Categorical Variables

classification 📊 stat.CO
keywords robustleveragevariablespredictorcategoricalcodedcomputedcontains
0
0 comments X
read the original abstract

For a robust leverage diagnostic in linear regression, Rousseeuw and van Zomeren [1990] proposed using robust distance (Mahalanobis distance computed using robust estimates of location and covariance). However, a design matrix X that contains coded categorical predictor variables is often sufficiently sparse that robust estimates of location and covariance cannot be computed. Specifically, matrices formed by taking subsets of the rows of X are likely to be singular, causing algorithms that rely on subsampling to fail. Following the spirit of Maronna and Yohai [2000], we observe that extreme leverage points are extreme in the continuous predictor variables. We therefore propose a robust leverage diagnostic that combines a robust analysis of the continuous predictor variables and the classical definition of leverage.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.