Significance analysis and statistical mechanics: an application to clustering

Johannes Berg; Marta {\L}uksza; Michael L\"assig

arxiv: 1009.2470 · v1 · pith:QGHIFXEVnew · submitted 2010-09-13 · 🧬 q-bio.MN · cond-mat.stat-mech· q-bio.QM

Significance analysis and statistical mechanics: an application to clustering

Marta {\L}uksza , Michael L\"assig , Johannes Berg This is my paper

classification 🧬 q-bio.MN cond-mat.stat-mechq-bio.QM

keywords statisticalclustersignificancevectorsapplicationclusteringdatamechanics

0 comments

read the original abstract

This paper addresses the statistical significance of structures in random data: Given a set of vectors and a measure of mutual similarity, how likely does a subset of these vectors form a cluster with enhanced similarity among its elements? The computation of this cluster p-value for randomly distributed vectors is mapped onto a well-defined problem of statistical mechanics. We solve this problem analytically, establishing a connection between the physics of quenched disorder and multiple testing statistics in clustering and related problems. In an application to gene expression data, we find a remarkable link between the statistical significance of a cluster and the functional relationships between its genes.

This paper has not been read by Pith yet.

Significance analysis and statistical mechanics: an application to clustering

discussion (0)