Discovering Structure in High-Dimensional Data Through Correlation Explanation

Aram Galstyan; Greg Ver Steeg

arxiv: 1406.1222 · v2 · pith:Z4Q2BFLGnew · submitted 2014-06-04 · 💻 cs.LG · cs.AI· stat.ML

Discovering Structure in High-Dimensional Data Through Correlation Explanation

Greg Ver Steeg , Aram Galstyan This is my paper

classification 💻 cs.LG cs.AIstat.ML

keywords datacorrelationexplanationmethodstructureabstractapproachassumptions

0 comments

read the original abstract

We introduce a method to learn a hierarchy of successively more abstract representations of complex data based on optimizing an information-theoretic objective. Intuitively, the optimization searches for a set of latent factors that best explain the correlations in the data as measured by multivariate mutual information. The method is unsupervised, requires no model assumptions, and scales linearly with the number of variables which makes it an attractive approach for very high dimensional systems. We demonstrate that Correlation Explanation (CorEx) automatically discovers meaningful structure for data from diverse sources including personality tests, DNA, and human language.

This paper has not been read by Pith yet.

Discovering Structure in High-Dimensional Data Through Correlation Explanation

discussion (0)