pith. sign in

arxiv: 1002.4547 · v1 · submitted 2010-02-24 · 🧮 math.ST · stat.TH

A two-sample test for high-dimensional data with applications to gene-set testing

classification 🧮 math.ST stat.TH
keywords datatesthigh-dimensionaldimensionmuchproposedsamplesize
0
0 comments X
read the original abstract

We propose a two-sample test for the means of high-dimensional data when the data dimension is much larger than the sample size. Hotelling's classical $T^2$ test does not work for this "large $p$, small $n$" situation. The proposed test does not require explicit conditions in the relationship between the data dimension and sample size. This offers much flexibility in analyzing high-dimensional data. An application of the proposed test is in testing significance for sets of genes which we demonstrate in an empirical study on a leukemia data set.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.