Feature Screening for High-Dimensional Structural Break Predictive Regression
Pith reviewed 2026-06-26 20:09 UTC · model grok-4.3
The pith
A screening procedure selects sparse active predictors and change points in high-dimensional structural break predictive regressions.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The procedure begins by identifying the active predictors using a Sure Independence Canonical Screening procedure, estimates the change points through a Ratio-Controlled Regression Screening method that allows their number to increase with the sample size, and reduces redundancy by eliminating unnecessary breakpoints and predictors using information criteria, allowing consistent estimation and selection of true breakpoints and active predictors that may be stationary or cointegrated.
What carries the argument
Sure Independence Canonical Screening (SICS) followed by Ratio-Controlled Regression Screening (RCRS) to identify sparse active predictors and change points.
If this is right
- Consistent selection holds even when the number of change points grows with sample size.
- Active predictors that are cointegrated can still be recovered correctly.
- The information-criteria step removes redundant breakpoints and predictors after initial screening.
- Simulations and empirical applications on return data show the steps work in practice.
Where Pith is reading between the lines
- The same screening steps could be tested in other high-dimensional time-series settings with multiple regimes.
- Improved selection might translate to better forecasts when relationships shift over time.
- Extensions could explore relaxing exact sparsity or adding nonlinear break forms.
Load-bearing premise
The Sure Independence Canonical Screening and Ratio-Controlled Regression Screening steps correctly identify the sparse active predictors and change points under the high-dimensional regime with possible cointegration.
What would settle it
A high-dimensional dataset with known true sparse predictors and known change points where the procedure fails to recover them consistently would falsify the consistency claim.
Figures
read the original abstract
Predictive regression is a crucial tool for exploring return predictability. In this study, we introduce an efficient procedure for selecting and estimating active predictors and change points in structural break predictive regression. Our approach allows the number of change points to increase with the sample size and accommodates sparse active predictors that may be stationary or cointegrated. We begin by identifying the active predictors using a Sure Independence Canonical Screening (SICS) procedure. Next, we estimate the change points through a Ratio-Controlled Regression Screening (RCRS) method. Finally, we reduce redundancy by eliminating unnecessary breakpoints and predictors using information criteria (IC). This approach allows for consistent estimation and selection of true breakpoints and active predictors. Our simulations and empirical studies demonstrate that the proposed procedure performs effectively.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a three-step procedure for high-dimensional structural break predictive regression: Sure Independence Canonical Screening (SICS) to select sparse active predictors that may be stationary or cointegrated, Ratio-Controlled Regression Screening (RCRS) to estimate a possibly diverging number of change points, and information criteria (IC) to prune redundant breakpoints and predictors. The central claim is that the procedure achieves consistent estimation and selection of the true breakpoints and active predictors, with supporting evidence from simulations and empirical studies.
Significance. If the consistency results hold, particularly the sure-screening guarantee under cointegration and exponential growth of p, the method would offer a practical tool for econometric applications involving return predictability with structural breaks and high-dimensional regressors. The explicit accommodation of cointegrated I(1) predictors and diverging breaks distinguishes it from existing screening methods that assume stationarity.
major comments (2)
- [Abstract] Abstract and theoretical development: The claim that SICS achieves sure screening (with probability approaching 1) for active predictors including cointegrated ones, while p grows exponentially in n, is asserted without any derivation, assumption set, or high-dimensional rate that extends the canonical correlation ranking to I(1) processes. Standard sure-independence screening proofs rely on weak dependence for uniform convergence of marginal utilities; cointegration can induce persistent cross-sectional dependence that violates this, rendering the subsequent RCRS and IC steps moot. This is load-bearing for the consistency result stated in the abstract.
- [Theoretical results (wherever stated)] No section supplies the required non-stationary extension or explicit growth condition on p that would keep the SICS property intact under cointegration, as required by the weakest assumption identified in the stress test. Without this, the simulation performance cannot be taken as evidence that the procedure works in the regime claimed.
minor comments (1)
- [Abstract] The abstract states that simulations 'demonstrate that the proposed procedure performs effectively' but provides no details on design (e.g., dimension p, break magnitudes, cointegration strength) or metrics; these should be summarized in the main text or a table for reproducibility.
Simulated Author's Rebuttal
We thank the referee for the thorough review and for highlighting the critical gap in the theoretical support for the SICS procedure. We address the two major comments point by point below. Revisions will be made to align the claims with the available derivations.
read point-by-point responses
-
Referee: [Abstract] Abstract and theoretical development: The claim that SICS achieves sure screening (with probability approaching 1) for active predictors including cointegrated ones, while p grows exponentially in n, is asserted without any derivation, assumption set, or high-dimensional rate that extends the canonical correlation ranking to I(1) processes. Standard sure-independence screening proofs rely on weak dependence for uniform convergence of marginal utilities; cointegration can induce persistent cross-sectional dependence that violates this, rendering the subsequent RCRS and IC steps moot. This is load-bearing for the consistency result stated in the abstract.
Authors: We agree that the manuscript asserts the sure-screening property for cointegrated predictors under exponential growth of p without supplying the required derivation or rate conditions. The existing proofs rely on weak dependence assumptions that do not automatically extend to the persistent dependence induced by cointegration. We will revise the abstract to state that the sure-screening guarantee is established only under stationarity, while performance for cointegrated predictors is illustrated via simulations. This removes the unsupported claim from the abstract. revision: yes
-
Referee: [Theoretical results (wherever stated)] No section supplies the required non-stationary extension or explicit growth condition on p that would keep the SICS property intact under cointegration, as required by the weakest assumption identified in the stress test. Without this, the simulation performance cannot be taken as evidence that the procedure works in the regime claimed.
Authors: We concur that no section provides the non-stationary extension or the explicit growth condition on p for the cointegrated case. Simulations alone cannot substitute for the missing high-dimensional theory. We will add a clarifying paragraph in the theoretical results section that explicitly restricts the consistency claims to the stationary setting and notes the absence of a full extension to I(1) processes under exponential p. revision: yes
Circularity Check
No circularity; procedures are independently proposed
full rationale
The manuscript proposes two new screening procedures (SICS for active predictors and RCRS for breakpoints) followed by IC-based refinement. These steps are defined directly from the data and model assumptions without reducing to prior fitted quantities, self-citations, or ansatzes imported from the authors' earlier work. The consistency claims rest on the explicit algorithmic definitions and the stated high-dimensional regime rather than on any definitional equivalence or load-bearing self-reference. No equation or section equates a derived quantity to its own input by construction.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Journal of Financial and Quantitative Analysis , volume=
Predictive regressions: A reduced-bias estimation method , author=. Journal of Financial and Quantitative Analysis , volume=
-
[2]
NBER Macroeconomics Annual , volume=
Has the business cycle changed and why? , author=. NBER Macroeconomics Annual , volume=. 2002 , publisher=
2002
-
[3]
Journal of the American statistical association , volume=
Forecasting using principal components from a large number of predictors , author=. Journal of the American statistical association , volume=. 2002 , publisher=
2002
-
[4]
Bai, Jushan and Ng, Serena , year =. A. Econometrica , volume =. doi:10.1111/j.1468-0262.2004.00528.x , urldate =
-
[5]
Journal of Business & Economic Statistics , volume=
Testing for common trends in nonstationary large datasets , author=. Journal of Business & Economic Statistics , volume=. 2022 , publisher=
2022
-
[6]
Testing for Structural Change of
Berkes, Istv. Testing for Structural Change of. 2011 , journal =
2011
-
[7]
2015 , journal =
Instrumental Variable and Variable Addition Based Inference in Predictive Regressions , author =. 2015 , journal =
2015
-
[8]
B. Variable Selection in High-Dimensional Linear Models: Partially Faithful Distributions and the Pc-Simple Algorithm , shorttitle =. Biometrika , volume =. doi:10.1093/biomet/asq008 , urldate =
-
[9]
2022 , journal =
A New Robust Inference for Predictive Quantile Regression , author =. 2022 , journal =
2022
-
[10]
2014 , journal =
Testing Predictive Regression Models with Nonstationary Regressors , author =. 2014 , journal =
2014
-
[11]
and Shiller, Robert J
Campbell, John Y. and Shiller, Robert J. , year =. The. The Review of Financial Studies , volume =
-
[12]
2006 , journal =
Efficient Tests of Stock Return Predictability , author =. 2006 , journal =
2006
-
[13]
Implementing the
Campbell, John Y and Yogo, Motohiro , langid =. Implementing the
-
[14]
, year =
Caner, Mehmet and Hansen, Bruce E. , year =. Threshold. Econometrica , volume =
-
[15]
Chan, Ngaihang and Yau, Chunyip and Zhang, Rongmao , year =. Group. Journal of the American Statistical Association , volume =. doi:10.1080/01621459.2013.866566 , urldate =
-
[16]
Journal of Econometrics , series =
Chan, Ngai Hang and Yau, Chun Yip and Zhang, Rong-Mao , year =. Journal of Econometrics , series =. doi:10.1016/j.jeconom.2015.03.023 , urldate =
-
[17]
Annals of Statistics , volume=
Limiting distributions of least squares estimates of unstable autoregressive processes , author=. Annals of Statistics , volume=. 1988 , publisher=
1988
-
[18]
Biometrika , volume=
Extended Bayesian information criteria for model selection with large model spaces , author=. Biometrika , volume=. 2008 , publisher=
2008
-
[19]
doi: 10.1080/01621459.2016.1211016
Error Variance Estimation in Ultrahigh-Dimensional Additive Models , author =. 2018 , journal =. doi:10.1080/01621459.2016.1251440 , urldate =
-
[20]
Chen, Willa W. and Deo, Rohit S. and Yi, Yanping , year =. Uniform. Journal of Business & Economic Statistics , volume =. doi:10.1080/07350015.2013.818008 , urldate =
-
[21]
Demetrescu, Matei and Georgiev, Iliyan and Rodrigues, Paulo M. M. and Taylor, A. M. Robert , year =. Extensions to. Journal of Econometrics , issn =
-
[22]
and Fuller, Wayne A
Dickey, David A. and Fuller, Wayne A. , year =. Distribution of the. Journal of the American Statistical Association , volume =
-
[23]
1920 , publisher=
Scientific stock speculation , author=. 1920 , publisher=
1920
-
[24]
2011 , journal =
A Control Function Approach for Testing the Usefulness of Trending Variables in Forecast Models and Linear Regression , author =. 2011 , journal =
2011
-
[25]
, year =
Elliott, Graham and Stock, James H. , year =. Inference in. Econometric Theory , volume =
-
[26]
1993 , journal =
Common Risk Factors in the Returns on Stocks and Bonds , author =. 1993 , journal =
1993
-
[27]
Journal of the American Statistical Association , year =
Are Latent Factor Regression and Sparse Regression Adequate? , author =. 2024 , journal =. doi:10.1080/01621459.2023.2169700 , urldate =
-
[28]
Journal of the Royal Statistical Society: Series B (Statistical Methodology) , volume =
Sure Independence Screening for Ultrahigh Dimensional Feature Space , author =. 2008 , journal =. doi:10.1111/j.1467-9868.2008.00674.x , urldate =
-
[29]
Journal of Business & Economic Statistics , year =
Determination of the Effective Cointegration Rank in High-Dimensional Time-Series Predictive Regressions , author =. Journal of Business & Economic Statistics , year =. doi:10.1080/07350015.2025.2550473 , urldate =
-
[30]
Predictive Quantile Regression with Mixed Roots and Increasing Dimensions:
Fan, Rui and Lee, Ji Hyung and Shin, Youngki , year =. Predictive Quantile Regression with Mixed Roots and Increasing Dimensions:. Journal of Econometrics , volume =. doi:10.1016/j.jeconom.2022.11.006 , urldate =
-
[31]
Predictive Quantile Regressions under Persistence and Conditional Heteroskedasticity , author =. 2019 , journal =. doi:10.1016/j.jeconom.2019.04.014 , urldate =
-
[32]
Journal of the American Statistical Association , volume =
Gao, Zhaoxing and Tsay, Ruey S. , year =. Modeling. Journal of the American Statistical Association , volume =. doi:10.1080/01621459.2020.1862668 , urldate =
-
[33]
2026 , publisher=
Gao, Zhan and Lee, Ji Hyung and Mei, Ziwei and Shi, Zhentao , journal=. 2026 , publisher=
2026
-
[34]
2018 , journal =
Testing for Parameter Instability in Predictive Regression Models , author =. 2018 , journal =
2018
-
[35]
2013 , series =
Matrix Computations , author =. 2013 , series =
2013
-
[36]
Gonzalo, Jes. Regime-. 2012 , journal =
2012
-
[37]
Inferring the
Gonzalo, Jes. Inferring the. 2017 , journal =
2017
-
[38]
Hansen, Bruce E. , year =. Convergence to. Econometric Theory , volume =. doi:10.1017/S0266466600013189 , urldate =
-
[39]
Ing, Ching-Kang , year =. Model. The Annals of Statistics , volume =. 26931545 , eprinttype =
-
[40]
Nonparametric Predictive Regression , author =. 2015 , journal =. doi:10.1016/j.jeconom.2014.05.015 , urldate =
-
[41]
Journal of Econometrics , volume=
The limit distribution of the estimates in cointegrated regression models with multiple structural changes , author=. Journal of Econometrics , volume=. 2008 , publisher=
2008
-
[42]
Sure Screening by Ranking the Canonical Correlations , author =. 2017 , journal =. doi:10.1007/s11749-016-0497-z , urldate =
-
[43]
2020 , journal =
High-Dimensional Predictive Regression in the Presence of Cointegration , author =. 2020 , journal =
2020
-
[44]
, year =
Kostakis, Alexandros and Magdalinos, Tassos and Stamatogiannis, Michalis P. , year =. Robust. The Review of Financial Studies , volume =
-
[45]
The Annals of Statistics , pages=
Factor modeling for high-dimensional time series: inference for the number of factors , author=. The Annals of Statistics , pages=. 2012 , publisher=
2012
-
[46]
Lee, JiHyung and Shi, Zhentao and Gao, Zhan , year =. On. Journal of Econometrics , volume =. doi:10.1016/j.jeconom.2021.02.002 , urldate =
-
[47]
Electronic Journal of Statistics , number =
Yingbo Li and Robert Lund and Anuradha Hewaarachchi , title =. Electronic Journal of Statistics , number =
-
[48]
Meta-Analysis of Rare Binary Adverse Event Data
Feature screening via distance correlation learning , author =. 2012 , journal =. doi:10.1080/01621459.2012.695654 , urldate =
-
[49]
Variable Selection via Partial Correlation , author =. Statistica Sinica , volume =. doi:10.5705/ss.202015.0473 , urldate =
-
[51]
arXiv preprint arXiv:2409.10860 , year=
Cointegrated matrix autoregression models , author=. arXiv preprint arXiv:2409.10860 , year=
-
[52]
2018 , journal =
A Perspective on Recent Methods on Testing Predictability of Asset Returns , author =. 2018 , journal =
2018
-
[53]
Journal of Econometrics , volume=
Estimation for double-nonlinear cointegration , author=. Journal of Econometrics , volume=. 2020 , publisher=
2020
-
[54]
Li, Dong and Ling, Shiqing and Zhang, Rongmao , year =. On a. Journal of Business & Economic Statistics , volume =
-
[55]
Li, Chenxue and Li, Deyuan and Peng, Liang , year =. Uniform. Journal of Business & Economic Statistics , volume =. doi:10.1080/07350015.2015.1052460 , urldate =
-
[56]
Liu, Xiaohui and Long, Wei and Peng, Liang and Yang, Bingduo , year =. A. Journal of the American Statistical Association , volume =. doi:10.1080/01621459.2023.2203354 , urldate =
-
[57]
Robust Inference with Stochastic Local Unit Root Regressors in Predictive Regressions , author =. 2023 , journal =. doi:10.1016/j.jeconom.2022.06.002 , urldate =
-
[58]
Ma, Chenchen and Tu, Yundong , year =. Group Fused. Journal of Econometrics , volume =. doi:10.1016/j.jeconom.2022.02.003 , urldate =
-
[59]
Journal of Business & Economic Statistics , volume=
McCracken, Michael W. and Ng, Serena , year =. Journal of Business & Economic Statistics , volume =. doi:10.1080/07350015.2015.1086655 , urldate =
-
[60]
Mei, Ziwei and Shi, Zhentao , journal=. On. 2024 , publisher=
2024
-
[61]
Onatski, Alexei and Wang, Chen , year =. Spurious. Econometrica , volume =. doi:10.3982/ECTA16703 , urldate =
-
[62]
Fractal and Fractional , volume=
A Novel Approach for Testing Fractional Cointegration in Panel Data Models with Fixed Effects , author=. Fractal and Fractional , volume=. 2024 , publisher=
2024
-
[63]
, year =
Owen, Art B. , year =. Empirical
-
[64]
Phillips, P. C. B. , year =. Regression. Econometrica , volume =
-
[65]
Predictive Regression under Various Degrees of Persistence and Robust Long-Horizon Regression , author =. 2013 , journal =. doi:10.1016/j.jeconom.2013.04.011 , urldate =
-
[66]
arXiv preprint arXiv:2408.05665 , year=
Change-Point Detection in Time Series Using Mixed Integer Programming , author=. arXiv preprint arXiv:2408.05665 , year=
-
[67]
Qu, Zhongjun and Perron, Pierre , year =. Estimating and. Econometrica , volume =. doi:10.1111/j.1468-0262.2006.00754.x , urldate =
-
[68]
2016 , journal =
Short Interest and Aggregate Stock Returns , author =. 2016 , journal =
2016
-
[69]
2019 , journal =
Balanced Predictive Regressions , author =. 2019 , journal =
2019
-
[70]
Safikhani, Abolfazl and Shojaie, Ali , year =. Joint. Journal of the American Statistical Association , volume =. doi:10.1080/01621459.2020.1770097 , urldate =
-
[71]
Schweikert, Karsten , year =. Oracle. Journal of Time Series Analysis , volume =. doi:10.1111/jtsa.12593 , urldate =
-
[72]
Applied Mathematical Modelling , volume=
Change-points analysis for generalized integer-valued autoregressive model via minimum description length principle , author=. Applied Mathematical Modelling , volume=. 2024 , publisher=
2024
-
[73]
arXiv preprint arXiv:1911.10552 , year=
High-dimensional forecasting in the presence of unit roots and cointegration , author=. arXiv preprint arXiv:1911.10552 , year=
arXiv 1911
-
[74]
1999 , journal =
Predictive Regressions , author =. 1999 , journal =
1999
-
[75]
Penetrating Sporadic Return Predictability , author =. 2023 , journal =. doi:10.1016/j.jeconom.2023.105509 , urldate =
-
[76]
Biometrics , volume=
Multikink quantile regression for longitudinal data with application to progesterone data analysis , author=. Biometrics , volume=. 2023 , publisher=
2023
-
[77]
Wang, Hansheng , year =. Forward. Journal of the American Statistical Association , volume =. doi:10.1198/jasa.2008.tm08516 , urldate =
-
[78]
A Note on Adaptive Group Lasso , author =. 2008 , journal =. doi:10.1016/j.csda.2008.05.006 , urldate =
-
[79]
2008 , journal =
A comprehensive Look at The Empirical Performance of Equity Premium Prediction , author =. 2008 , journal =
2008
-
[80]
Proceedings of the National Academy of Sciences , volume=
Nonlinear system theory: Another look at dependence , author=. Proceedings of the National Academy of Sciences , volume=. 2005 , publisher=
2005
-
[81]
Journal of Business & Economic Statistics , volume =
Regime-Specific Return Predictability in Quantiles , author=. Journal of Business & Economic Statistics , volume =. 2026 , publisher=
2026
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.