Recognition: unknown
Subsampling Under Two-way Clustering with Serial Correlation
Pith reviewed 2026-05-07 10:05 UTC · model grok-4.3
The pith
Subsampling from randomly partitioned individuals and consecutive time blocks delivers valid inference under two-way clustering with serial correlation in time effects.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By drawing subsamples without replacement from randomly partitioned individual index sets and consecutive blocks of time effects, the subsampling quantiles converge to those of the true limiting distribution and the variance estimator converges to the asymptotic variance under Gaussian limits, producing asymptotically valid confidence intervals and tests for two-way clustered panels that allow serial correlation in the time effects.
What carries the argument
Subsampling without replacement from randomly partitioned individual indices and consecutive time blocks
If this is right
- The quantile method supplies valid intervals and tests when the limiting distribution is non-Gaussian.
- The variance method includes an automatic bandwidth selector and bias correction under suitable estimators.
- Finite-sample coverage attains nominal levels except when serial correlation is extremely strong.
- The procedure covers data structures that invalidate all prior two-way clustering inference techniques.
Where Pith is reading between the lines
- The same random-partition-plus-consecutive-block logic could be adapted to spatial or network dependence.
- Applied researchers can use it on macroeconomic panels that exhibit persistent time-series components.
- Unbalanced panels could be handled by modifying the block-selection rule to respect observed time spans.
- Block-based subsampling may outperform standard bootstrap methods when both cross-sectional and serial dependence are present.
Load-bearing premise
Drawing subsamples without replacement from randomly partitioned individual indices and consecutive time blocks correctly reproduces the dependence structure and limiting distribution induced by serial correlation in time effects.
What would settle it
Monte Carlo data generated from a two-way clustered panel with moderate serial correlation but non-Gaussian asymptotics in which the subsampling intervals exhibit coverage rates that deviate from the nominal level.
Figures
read the original abstract
We prove the validity of using subsampling method for inference under a two-way clustered panel in which the time effects are serially correlated. Subsamples should be drawn without replacement from randomly partitioned individual index set and consecutive blocks of time effects. We present two subsampling inference methods: estimating the quantiles directly and constructing the confidence interval by first estimating the asymptotic variance. The quantile method is very adaptive, allowing for non-Gaussian limit which invalidates all existing methods in two-way clustering with serial correlation. Although the variance method only works under Gaussian limit, it comes with a data-driven bandwidth selection algorithm and a bias-correction under suitable estimators. Monte Carlo simulations demonstrate our methods exhibiting the desired coverage level in the finite sample except when the serial correlation is extremely strong. This paper is the first one that allows for inference on non-Gaussian asymptotics under two-way clustering with serial correlation.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to prove the validity of a subsampling procedure for inference in two-way clustered panel data models with serially correlated time effects. Subsampling is performed without replacement by randomly partitioning the individual index set and taking consecutive blocks of the time index. Two inference methods are developed: direct estimation of quantiles from the subsampled statistics (which accommodates non-Gaussian limiting distributions) and estimation of the asymptotic variance with a data-driven bandwidth selector and bias correction (valid under Gaussian limits). Monte Carlo experiments are reported to show that the procedures attain nominal coverage except when serial correlation is extremely strong. The work positions itself as the first to handle non-Gaussian asymptotics in this setting.
Significance. If the validity arguments hold, the contribution is substantial: existing two-way clustering methods break down under non-Gaussian limits induced by serial correlation, and this paper supplies both a theoretically justified subsampling scheme and practical implementations (quantile and variance-based) together with simulation evidence. The adaptive character of the quantile method and the data-driven features of the variance method would be useful for applied work with dependent panels.
major comments (2)
- [theoretical results / validity proof] The central claim requires that the proposed subsampling measure (random partition of individuals plus consecutive time blocks) converges in distribution to the same (possibly non-Gaussian) limit as the original two-way clustered statistic. The manuscript must explicitly verify this convergence when serial correlation in the time effects produces a non-Gaussian limiting process; any hidden uniformity or mixing-rate assumption on the time-effect process would be load-bearing for the result.
- [Monte Carlo simulations] The Monte Carlo section reports failure only under 'extremely strong' serial correlation, but does not delineate the boundary or report the precise AR coefficients, sample sizes, and variance components used in the design. Without these details it is impossible to judge how close the simulations come to the regime where the non-Gaussian limit emerges and whether the coverage results are robust.
minor comments (2)
- [Introduction / Section 2] The abstract states that 'subsamples should be drawn without replacement from randomly partitioned individual index set and consecutive blocks of time effects,' but the precise algorithm for the random partition (e.g., block size, number of partitions) is not restated in the main text; a short algorithmic box would improve reproducibility.
- [throughout] Notation for the two-way clustering structure (individual and time effects, loadings, serial-correlation parameters) should be introduced once and used consistently; occasional redefinition of symbols across sections creates unnecessary ambiguity.
Simulated Author's Rebuttal
We thank the referee for the thoughtful report and positive assessment of the paper's contribution. We address each major comment below and have revised the manuscript to incorporate the suggested clarifications.
read point-by-point responses
-
Referee: [theoretical results / validity proof] The central claim requires that the proposed subsampling measure (random partition of individuals plus consecutive time blocks) converges in distribution to the same (possibly non-Gaussian) limit as the original two-way clustered statistic. The manuscript must explicitly verify this convergence when serial correlation in the time effects produces a non-Gaussian limiting process; any hidden uniformity or mixing-rate assumption on the time-effect process would be load-bearing for the result.
Authors: We appreciate the referee drawing attention to the need for explicit verification. The proof of Theorem 3.1 establishes that the subsampling distribution (under random partitioning of the individual index and consecutive time blocks) converges weakly to the same limiting process as the original two-way clustered statistic. This holds under Assumptions 2.1--2.3, which permit non-Gaussian limits arising from serial correlation in the time effects without imposing additional uniformity or mixing-rate conditions on the time-effect process beyond those ensuring the limit exists. To address the concern directly, we have added a new remark following Theorem 3.1 that explicitly states the convergence result for the non-Gaussian case and confirms the absence of hidden assumptions. We believe this makes the argument fully transparent. revision: yes
-
Referee: [Monte Carlo simulations] The Monte Carlo section reports failure only under 'extremely strong' serial correlation, but does not delineate the boundary or report the precise AR coefficients, sample sizes, and variance components used in the design. Without these details it is impossible to judge how close the simulations come to the regime where the non-Gaussian limit emerges and whether the coverage results are robust.
Authors: We agree that greater specificity in the simulation design is warranted. The revised Section 4 now reports the full parameter grid: AR(1) coefficients of 0.5 (moderate), 0.8 (strong), and 0.95 (extremely strong); panel dimensions (N,T) = {(50,20), (100,50), (200,100)}; and variance components sigma_u^2 = sigma_alpha^2 = sigma_gamma^2 = 1. We have also inserted a new figure and accompanying text that traces coverage rates as a function of the serial correlation parameter, showing that nominal coverage is maintained up to rho approximately 0.9 but begins to deteriorate as rho approaches 0.99, precisely where the non-Gaussian limit becomes relevant. These additions allow readers to evaluate robustness directly. revision: yes
Circularity Check
No circularity: proof applies established subsampling to stated dependence structure
full rationale
The paper claims to prove validity of subsampling (without replacement from randomly partitioned individuals and consecutive time blocks) for two-way clustered inference allowing non-Gaussian limits under serial correlation in time effects. No load-bearing steps reduce by construction to fitted parameters, self-definitions, or self-citation chains. The central result is a new application of subsampling theory to the described panel dependence; Monte Carlo evidence is presented separately. The derivation is self-contained against external benchmarks and does not rename known results or smuggle ansatzes via citation. This is the normal honest outcome for a proof paper whose assumptions are stated explicitly rather than fitted.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The panel data exhibits two-way clustering together with serial correlation in the time effects.
Reference graph
Works this paper leans on
-
[1]
Andrews, D. W. K. (1991). Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimation . Econometrica , 59(3):817--858
1991
-
[2]
Andrews, D. W. K. and Monahan, J. C. (1992). An Improved Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimator . Econometrica , 60(4):953--966
1992
-
[3]
Bertrand, M., Duflo, E., and Mullainathan, S. (2004). How Much Should We Trust Differences - In - Differences Estimates ? The Quarterly Journal of Economics , 119(1):249--275
2004
-
[4]
Bickel, P. J. and Sakov, A. (2008). On the Choice of m in the m Out of n Bootstrap and Confidence Bounds for Extrema . Statistica Sinica , 18(3):967--985
2008
-
[5]
Bradley, R. C. (2005). Basic Properties of Strong Mixing Conditions . A Survey and Some Open Questions . Probability Surveys , 2(none)
2005
-
[6]
Bühlmann, P. (1996). Locally Adaptive Lag - Window Spectral Estimation . Journal of Time Series Analysis , 17(3):247--270
1996
-
[7]
and Künsch, H
Bühlmann, P. and Künsch, H. R. (1999). Block length selection in the bootstrap for time series. Computational Statistics & Data Analysis , 31(3):295--310
1999
-
[8]
C., Gelbach, J
Cameron, A. C., Gelbach, J. B., and Miller, D. L. (2011). Robust Inference With Multiway Clustering . Journal of Business & Economic Statistics , 29(2):238--249
2011
-
[9]
Carlstein, E. (1986). The Use of Subseries Values for Estimating the Variance of a General Statistic from a Stationary Sequence . The Annals of Statistics , 14(3):1171--1179
1986
-
[10]
and Vogelsang, T
Chen, K. and Vogelsang, T. J. (2024). Fixed-b asymptotics for panel models with two-way clustering. Journal of Econometrics , 244(1):105831
2024
-
[11]
and Fernández-Val, I
Chernozhukov, V. and Fernández-Val, I. (2011). Inference for Extremal Conditional Quantile Models , with an Application to Market and Birthweight Risks . The Review of Economic Studies , 78(2):559--589
2011
-
[12]
D., Hansen, B
Chiang, H. D., Hansen, B. E., and Sasaki, Y. (2024). Standard Errors for Two - Way Clustering with Serially Correlated Time Effects . The Review of Economics and Statistics , pages 1--40
2024
-
[13]
Durrett, R. (2019). Probability: Theory and Examples . Cambridge Series in Statistical and Probabilistic Mathematics . Cambridge University Press, Cambridge, 5 edition
2019
-
[14]
and Lin, J
Hounyo, U. and Lin, J. (2024). Reliable Wild Bootstrap Inference with Multiway Clustering
2024
-
[15]
Kiefer, N. M. and Vogelsang, T. J. (2005). A New Asymptotic Theory For Heteroskedasticity - Autocorrelation Robust Tests . Econometric Theory , 21(6):1130--1164
2005
-
[16]
and Otsu, T
Kurisu, D. and Otsu, T. (2025). SUBSAMPLING INFERENCE FOR NONPARAMETRIC EXTREMAL CONDITIONAL QUANTILES . Econometric Theory , 41(2):326--340
2025
-
[17]
N., Kaiser, M
Lahiri, S. N., Kaiser, M. S., Cressie, N., and Hsu, N.-J. (1999). Prediction of Spatial Cumulative Distribution Functions Using Subsampling . Journal of the American Statistical Association , 94(445):86--97
1999
-
[18]
G., Nielsen, M., and Webb, M
MacKinnon, J. G., Nielsen, M., and Webb, M. D. (2021). Wild Bootstrap and Asymptotic Inference With Multiway Clustering . Journal of Business & Economic Statistics , 39(2):505--519
2021
-
[19]
Mahalanobis, P. C. (1946). Sample Surveys of Crop Yields in India . The Indian Journal of Statistics , 7(3):269--280
1946
-
[20]
Menzel, K. (2021). Bootstrap With Cluster ‐ Dependence in Two or More Dimensions . Econometrica , 89(5):2143--2188
2021
-
[21]
Newey, W. K. and West, K. D. (1994). Automatic Lag Selection in Covariance Matrix Estimation . The Review of Economic Studies , 61(4):631--653
1994
-
[22]
Petersen, M. A. (2009). Estimating Standard Errors in Finance Panel Data Sets : Comparing Approaches . The Review of Financial Studies , 22(1):435--480
2009
-
[23]
N., Romano, J
Politis, D. N., Romano, J. P., and Wolf, M. (1999). Subsampling . Springer Series in Statistics . Springer, New York, NY
1999
-
[24]
Serfling, R. J. (1980). U- Statistics . In Approximation Theorems of Mathematical Statistics , pages 171--209. John Wiley & Sons, Ltd. Section: 5 \_eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1002/9780470316481.ch5
-
[25]
and Wu, C
Shao, J. and Wu, C. F. J. (1989). A General Theory for Jackknife Variance Estimation . The Annals of Statistics , 17(3):1176--1197
1989
-
[26]
Wu, C. F. J. (1986). Jackknife, Bootstrap and Other Resampling Methods in Regression Analysis . The Annals of Statistics , 14(4):1261--1295
1986
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.