pith. sign in

arxiv: 2311.02299 · v5 · pith:BSGZQXPXnew · submitted 2023-11-04 · 💰 econ.EM · stat.ME

The Fragility of Sparsity

Pith reviewed 2026-05-24 06:09 UTC · model grok-4.3

classification 💰 econ.EM stat.ME
keywords sparsitylinear regressionrobustnessordinary least squarescategorical controlsempirical applicationspost-selection inference
0
0 comments X

The pith

Sparsity-based linear regression estimates shift by two standard errors or more when the coding of categorical controls changes, while ordinary least squares estimates remain unchanged.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that methods relying on sparsity for linear regression produce unstable results because arbitrary but inconsequential choices in how the regressor matrix is built can alter the estimates substantially. In three empirical applications the authors find that changing baseline categories for categorical variables moves sparsity-based point estimates by two standard errors or more. They introduce two tests that compare sparsity estimators to ordinary least squares and report rejections of the sparsity assumption in all three cases. The authors conclude that ordinary least squares delivers more robust inference at little efficiency cost unless the number of regressors approaches the sample size.

Core claim

Estimates predicated on the assumption of sparsity are fragile in two ways. Different choices of the regressor matrix which do not impact ordinary least squares estimates, such as the choice of baseline category with categorical controls, can move sparsity-based estimates by two standard errors or more. Two tests of the sparsity assumption, developed by comparing sparsity-based estimators with OLS, tend to reject the sparsity assumption in all three applications. Unless the number of regressors is comparable to or exceeds the sample size, OLS yields more robust inference at little efficiency cost.

What carries the argument

Two tests that compare sparsity-based estimators to OLS, together with documented sensitivity of those estimators to arbitrary choices in the construction of the regressor matrix such as baseline-category coding.

If this is right

  • Sparsity-based estimates can change materially with coding decisions that leave OLS unaffected.
  • The sparsity assumption is rejected by the new tests in each of the three examined applications.
  • Ordinary least squares supplies more robust inference than sparsity-based methods whenever the number of regressors is much smaller than the sample size.
  • The efficiency advantage of sparsity methods is small relative to the gain in robustness from using OLS in typical settings.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Applied researchers using sparse methods may need to document stability across multiple plausible codings of categorical variables.
  • The same fragility pattern could appear in other regularized estimators that rely on implicit selection among many regressors.
  • Routine reporting of both sparse and OLS results side by side would let readers assess sensitivity directly.

Load-bearing premise

The two tests that compare sparsity-based estimators to OLS correctly detect when the sparsity assumption is violated.

What would settle it

Finding new data sets in which the two tests fail to reject sparsity or in which sparsity-based estimates remain stable across different codings of the same categorical controls would undermine the claim of general fragility.

read the original abstract

We show, using three empirical applications, that linear regression estimates predicated on the assumption of sparsity are fragile in two ways. First, we document that different choices of the regressor matrix which do not impact ordinary least squares (OLS) estimates, such as the choice of baseline category with categorical controls, can move sparsity-based estimates by two standard errors or more. Second, we develop two tests of the sparsity assumption by comparing sparsity-based estimators with OLS. The tests tend to reject the sparsity assumption in all three applications. Unless the number of regressors is comparable to or exceeds the sample size, OLS yields more robust inference at little efficiency cost.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper claims that linear regression estimates based on sparsity assumptions are fragile in two ways, documented via three empirical applications. Different choices of the regressor matrix (e.g., baseline category for categorical controls) that leave OLS estimates unchanged can shift sparsity-based estimates by two or more standard errors. Two new tests comparing sparsity estimators to OLS reject the sparsity assumption in the applications. OLS is recommended for robust inference unless the number of regressors is comparable to or exceeds sample size.

Significance. If the tests are valid detectors of sparsity violations, the results would illustrate concrete fragility of penalized estimators to innocuous design choices and provide evidence against sparsity in typical settings, favoring OLS with minimal efficiency cost. This would contribute to the high-dimensional regression literature by emphasizing robustness considerations in applied work.

major comments (2)
  1. [Section on the two tests of the sparsity assumption] The section developing the two tests: provides no analytic derivation of asymptotic size under the exact null (sparsity holds and design satisfies the paper's conditions). There is also no Monte Carlo evidence that the tests control size under regressor matrices with categorical controls or normalizations that the paper shows affect the penalized estimators. This is load-bearing for reading the application rejections as evidence against sparsity.
  2. [Empirical applications] Empirical applications: the abstract describes the strategy and rejections but supplies no details on test construction, exact specifications, or handling of multiple testing, so the degree to which the data support the central claim cannot be verified from the provided information.
minor comments (1)
  1. Some figures or tables could benefit from clearer labeling of the exact regressor-matrix variations being compared.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive report. The comments highlight important issues regarding the validation of our proposed tests and the presentation of the empirical results. We respond to each major comment below and indicate planned revisions.

read point-by-point responses
  1. Referee: [Section on the two tests of the sparsity assumption] The section developing the two tests: provides no analytic derivation of asymptotic size under the exact null (sparsity holds and design satisfies the paper's conditions). There is also no Monte Carlo evidence that the tests control size under regressor matrices with categorical controls or normalizations that the paper shows affect the penalized estimators. This is load-bearing for reading the application rejections as evidence against sparsity.

    Authors: We agree that formal size validation is important for interpreting the rejections. The manuscript does not contain an analytic derivation of asymptotic size under the exact null, as deriving the limiting distribution of the proposed tests (which compare penalized estimators to OLS) under sparsity is technically involved and depends on the specific penalty and design conditions. However, we will add Monte Carlo evidence in the revision demonstrating that the tests control size at conventional levels under regressor matrices that include categorical controls and the normalizations highlighted in the applications. This will directly address the concern for the designs used in the paper. revision: yes

  2. Referee: Empirical applications: the abstract describes the strategy and rejections but supplies no details on test construction, exact specifications, or handling of multiple testing, so the degree to which the data support the central claim cannot be verified from the provided information.

    Authors: The abstract is intentionally concise and focuses on the high-level strategy and findings. Full details on test construction, the exact specifications (including all controls, normalizations, and penalty choices), and any handling of multiple testing across applications are provided in Sections 3, 4, and 5 of the manuscript. To improve accessibility, we will revise the abstract to briefly reference the test approach and direct readers to the main text for specifications. revision: partial

Circularity Check

0 steps flagged

No circularity; derivation and tests are self-contained

full rationale

The paper introduces two new tests that compare sparsity-based estimators to OLS and applies them directly to three external empirical datasets. The documented fragility arises from explicit sensitivity checks on regressor matrix choices (e.g., baseline categories) that leave OLS unchanged. No load-bearing step reduces by construction to a fitted parameter, self-definition, or self-citation chain; the tests are presented as original contributions without invoking prior author work as an unverified uniqueness theorem or ansatz. The central claims rest on observable empirical differences rather than tautological equivalence to inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on standard domain assumptions in linear regression and the validity of the proposed sparsity tests; no free parameters or invented entities are mentioned.

axioms (1)
  • domain assumption Standard linear regression assumptions hold so that OLS serves as a reliable benchmark
    The tests and fragility comparisons treat OLS as the reference estimator whose properties are known under conventional conditions.

pith-pipeline@v0.9.0 · 5634 in / 1204 out tokens · 51606 ms · 2026-05-24T06:09:40.911398+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Variance or Standard Deviation? Shell Geometry and Global-Scale Priors in High-Dimensional Shrinkage

    stat.ME 2026-06 unverdicted novelty 6.0

    Under a radial-power benchmark, the SD-flat prior has a one-unit asymptotic risk advantage near the origin over the variance-flat prior, with crossover in the critical regime and second-order equivalence for strong signals.

Reference graph

Works this paper leans on

2 extracted references · 2 canonical work pages · cited by 1 Pith paper

  1. [1]

    http://jmlr.org/papers/v15/javanmard14a.html Jochmans, K. (2022). Heteroscedasticity-robust inference in linear regression models with many covariates. Journal of the American Statistical Association , 117 (538), 887–896. https://doi.org/10.1080/01621459.2020.1831924 Kline, P., Saggio, R., & Sølvsten, M. (2020). Leave-out estimation of variance components...

  2. [2]

    https://doi.org/10.1111/rssb.12026 45