pith. sign in

arxiv: 1604.00447 · v3 · pith:X4NBS5EMnew · submitted 2016-04-02 · 📊 stat.ME

Ordering-Free Inference from Locally Dependent Data

classification 📊 stat.ME
keywords inferencerandomizeddatastatisticstatisticsdependencesubsamplingtest
0
0 comments X
read the original abstract

This paper focuses on a data-rich environment where the data set has a very large cross-sectional dimension, is likely to exhibit local dependence, and yet is hard to determine the dependence ordering. Such a situation arises, for example, when the data set is collected from the Internet, through a method of web crawling. This paper proposes an approach of randomized subsampling inference, where one constructs a test statistic by aggregating many randomized test statistics using random draws of subsamples, and uses for inference the conditional distribution of the test statistic given data. This paper explores two approaches of such inference: one based on an M-type statistic constructed from randomized mean statistics and the other based on a U-type statistic constructed from randomized U-statistics. This paper provides conditions for local dependence, the number of the random draws, and the subsample size, under which randomized subsampling inference is asymptotically valid. From the Monte Carlo simulation studies, this paper finds that the randomized subsampling inference based on the U-type statistics performs better than that based on the M-type statistics.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.