pith. sign in

arxiv: 2606.24228 · v1 · pith:BE264UEInew · submitted 2026-06-23 · 📊 stat.ME

Uncertainty intervals for multilevel models with missing not at random data

Pith reviewed 2026-06-25 22:52 UTC · model grok-4.3

classification 📊 stat.ME
keywords sensitivity analysismissing not at randommultilevel modelspartial identificationuncertainty intervalslinear mixed-effectsdropout biasMNAR
0
0 comments X

The pith

A sensitivity analysis method produces uncertainty intervals for parameters in multilevel models when data is missing not at random.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a method for linear multilevel models that jointly models the outcome variable and the dropout process to derive a bias adjustment term. This adjustment is computed from observed data once values are supplied for one or more sensitivity parameters that capture the MNAR mechanism. When those parameters are restricted to a researcher-specified plausible range, the method yields bounds on the parameters of interest instead of point estimates. The resulting intervals support estimation and inference under assumptions weaker than the usual missing-at-random condition. The approach is checked through simulation and applied to longitudinal data on loneliness, physical activity, and memory trajectories.

Core claim

By modeling both the outcome and the dropout risk with multilevel structures, a bias adjustment can be derived that is identifiable from observed data conditional on fixed values of sensitivity parameters; restricting those parameters to a plausible interval then partially identifies the target parameters, producing uncertainty bounds that hold under MNAR mechanisms weaker than missing at random.

What carries the argument

The bias adjustment term obtained from the joint multilevel models for outcome and dropout, evaluated conditional on the sensitivity parameters.

If this is right

  • Parameters in multilevel models can be bounded rather than point-estimated when dropout is suspected to be MNAR.
  • Inference procedures can report intervals that reflect the range of possible bias adjustments consistent with the sensitivity assumptions.
  • The method applies directly to longitudinal studies with repeated measures and attrition.
  • Results remain valid under missingness mechanisms that violate the missing-at-random assumption but lie inside the specified sensitivity range.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same bounding strategy could be adapted to nonlinear or generalized linear multilevel models by deriving analogous bias terms.
  • Domain experts could use substantive knowledge to tighten or widen the sensitivity ranges in fields such as health or social science.
  • If the resulting bounds are narrow across a wide sensitivity range, that would indicate robustness even without assuming missing at random.

Load-bearing premise

The researcher can specify a plausible range for the sensitivity parameters that contains their true values.

What would settle it

A simulation in which the true MNAR mechanism is known and the computed uncertainty intervals do not contain the true parameter values for the chosen sensitivity ranges.

read the original abstract

We propose a sensitivity analysis method for missing not at random (MNAR) data in the context of linear multilevel (mixed-effects) models. The outcome and dropout risk are both modelled using multilevel models and a bias adjustment due to MNAR data is derived. This bias can be estimated from observed data conditional on specified values of sensitivity parameter(s). Under the assumption that these parameters lie within a plausible range, the method partially identify the parameters of interest, yielding bounds for estimation and inference under assumptions weaker than missing at random. The proposed analysis is investigated in a simulation study and illustrated with an analysis of the association between loneliness and physical activity with memory trajectories, adjusting for demographic, socioeconomic, and health covariates.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The manuscript proposes a sensitivity analysis method for missing not at random (MNAR) data in linear multilevel (mixed-effects) models. Both the outcome and the dropout mechanism are modeled via multilevel structures; a bias adjustment is derived that can be estimated from the observed data conditional on researcher-specified values of one or more sensitivity parameters. Under the assumption that these parameters lie in a plausible range, the approach yields bounds (uncertainty intervals) on the parameters of interest, enabling estimation and inference under assumptions weaker than missing at random. The method is examined in a simulation study and illustrated with an analysis of the association between loneliness, physical activity, and memory trajectories, adjusting for demographic, socioeconomic, and health covariates.

Significance. If the bias-adjustment derivation and bounding procedure are valid, the work supplies a practical sensitivity-analysis framework for a common data structure in longitudinal research. Multilevel models are widely used for clustered or repeated-measures data, yet MNAR dropout is rarely addressed beyond MAR assumptions; the proposed partial-identification approach therefore fills a methodological gap. The combination of an explicit bias formula, simulation evidence, and a substantive application strengthens the contribution.

major comments (2)
  1. [§3] §3 (bias-adjustment derivation): the adjustment formula must be shown to correctly incorporate the random-effects covariance structure when both the outcome and selection models are multilevel; without an explicit step that integrates over the random effects in the conditional expectation, it is unclear whether the bias term remains consistent for the fixed-effect parameters of primary interest.
  2. [§4] §4 (simulation design): the reported coverage probabilities and interval widths are obtained under correctly specified sensitivity-parameter ranges; a supplementary set of simulations in which the true sensitivity parameters lie near the boundary of the assumed range is needed to verify that the uncertainty intervals retain their nominal coverage when the researcher-specified range is only approximately correct.
minor comments (3)
  1. [§2] The notation distinguishing the sensitivity parameters from the model parameters should be introduced once in §2 and used consistently thereafter.
  2. [Table 2] Table 2 (simulation results) would benefit from an additional column reporting the proportion of replications in which the bounds contain the true parameter under each MNAR scenario.
  3. [§5] The real-data application in §5 reports point estimates and bounds but does not state the exact range chosen for the sensitivity parameters; this information should be added for reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript. We address each major comment below and indicate the revisions that will be incorporated.

read point-by-point responses
  1. Referee: §3 (bias-adjustment derivation): the adjustment formula must be shown to correctly incorporate the random-effects covariance structure when both the outcome and selection models are multilevel; without an explicit step that integrates over the random effects in the conditional expectation, it is unclear whether the bias term remains consistent for the fixed-effect parameters of primary interest.

    Authors: We agree that an explicit integration step would clarify the derivation. The bias adjustment in Section 3 is obtained by conditioning on the observed data and selection indicators under the joint multilevel model for the outcome and dropout process; this implicitly integrates over the random effects via their joint covariance structure. In the revision we will insert an additional displayed equation that explicitly performs the integration over the random-effects distribution, confirming that the resulting bias term is consistent for the fixed-effect parameters of interest. revision: yes

  2. Referee: §4 (simulation design): the reported coverage probabilities and interval widths are obtained under correctly specified sensitivity-parameter ranges; a supplementary set of simulations in which the true sensitivity parameters lie near the boundary of the assumed range is needed to verify that the uncertainty intervals retain their nominal coverage when the researcher-specified range is only approximately correct.

    Authors: We concur that this additional check would strengthen the simulation evidence. We will add a supplementary simulation study in which the true sensitivity parameters are placed at the boundary (and slightly outside) of the researcher-specified ranges, and will report the resulting coverage probabilities and interval widths for the uncertainty intervals. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation is conditional on external inputs

full rationale

The paper's central method derives a bias adjustment for MNAR data in multilevel models explicitly conditional on researcher-specified sensitivity parameters that are treated as external inputs (not fitted or derived from the target data). Partial identification then follows by varying those parameters over a plausible range. No load-bearing self-citations, self-definitional steps, or fitted inputs renamed as predictions are present in the abstract or described approach; the derivation remains self-contained against the stated conditioning.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The method depends on user-specified sensitivity parameters whose plausible range is treated as an external modeling choice rather than derived from data.

free parameters (1)
  • sensitivity parameters
    User-specified values that control the magnitude of the MNAR bias adjustment.
axioms (1)
  • domain assumption The sensitivity parameters lie within a plausible range
    This assumption enables partial identification and the production of bounds.

pith-pipeline@v0.9.1-grok · 5636 in / 1213 out tokens · 29922 ms · 2026-06-25T22:52:54.420551+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

12 extracted references · 7 canonical work pages

  1. [1]

    Bates, M

    Bates D, M¨ achler M, Bolker B, Walker S. Fitting Linear Mixed-Effects Models Using lme4. Journal of Statistical Software. 2015;67(1):1–48. https://doi.org/10.18637/jss.v067.i01. B¨ orsch-Supan A, Brandt M, Hunkler C, Kneip T, Korbmacher J, Malter F, et al. Data resource profile: the Survey of Health, Ageing and Retirement in Europe (SHARE). International...

  2. [2]

    A systematic literature review of attrition between waves in longitudinal studies in the elderly shows a consistent pattern of dropout between differing studies

    Chatfield MD, Brayne CE, Matthews FE. A systematic literature review of attrition between waves in longitudinal studies in the elderly shows a consistent pattern of dropout between differing studies. Journal of clinical epidemiology. 2005;58(1):13–19. Demidenko E. Mixed Models: Theory and Applications. Hoboken: Wiley

  3. [3]

    Missing not at random models for latent growth curve analyses

    Enders CK. Missing not at random models for latent growth curve analyses. Psychological methods. 2011;16(1):1. Enders CK. Missing data: An update on the state of the art. Psychological Methods. 2023;30:322–339. Galarza CE, Lachos VH. MomTrunc: Moments of Folded and Doubly Truncated Multivariate Distributions. R Foundation for Statistical Computing, Vienna...

  4. [4]

    ui: Uncertainty Intervals and Sensitivity Analysis for Missing Data; 2019, https: //CRAN.R-project.org/package=ui, r package version 0.1.1

    Genb¨ ack M. ui: Uncertainty Intervals and Sensitivity Analysis for Missing Data; 2019, https: //CRAN.R-project.org/package=ui, r package version 0.1.1. Genb¨ ack M, de Luna X. Causal inference accounting for unobserved confounding after outcome regression and doubly robust estimation. Biometrics. 2019;75(2):506–515. Genb¨ ack M, Stanghellini E, de Luna X...

  5. [5]

    Additive properties of skew normal random vectors

    Gonz´ alez-Far´ ıas G, Dom´ ınguez-Molina A, Gupta AK. Additive properties of skew normal random vectors. Journal of Statistical Planning and Inference. 2004;126(2):521–534. Grilli L, Rampichini C. Selection bias in linear mixed models. METRON - International Journal of Statistics. 2010;68(3):309–329. Hammon A, Zinn S. Multiple imputation of binary multil...

  6. [6]

    Register-based data indicated nonparticipation bias in a health study among aging people

    Nummela O, Sulander T, Helakorpi S, Haapola I, Uutela A, Heinonen H, et al. Register-based data indicated nonparticipation bias in a health study among aging people. Journal of clinical epidemiology. 2011;64(12):1418–1425. R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria; 2024, h...

  7. [7]

    SHARE-ERIC.: Survey of Health, Ageing and Retirement in Europe (SHARE) Wave

    https://doi.org/10.6103/SHARE.w4.900. SHARE-ERIC.: Survey of Health, Ageing and Retirement in Europe (SHARE) Wave

  8. [8]

    SHARE-ERIC.: Survey of Health, Ageing and Retirement in Europe (SHARE) Wave

    https://doi.org/10.6103/SHARE.w5.900. SHARE-ERIC.: Survey of Health, Ageing and Retirement in Europe (SHARE) Wave

  9. [9]

    SHARE-ERIC.: Survey of Health, Ageing and Retirement in Europe (SHARE) Wave

    https://doi.org/10.6103/SHARE.w6.900. SHARE-ERIC.: Survey of Health, Ageing and Retirement in Europe (SHARE) Wave

  10. [10]

    SHARE-ERIC.: Survey of Health, Ageing and Retirement in Europe (SHARE) Wave

    https://doi.org/10.6103/SHARE.w7.900. SHARE-ERIC.: Survey of Health, Ageing and Retirement in Europe (SHARE) Wave

  11. [11]

    SHARE-ERIC.: Survey of Health, Ageing and Retirement in Europe (SHARE) Wave

    https://doi.org/10.6103/SHARE.w8.900. SHARE-ERIC.: Survey of Health, Ageing and Retirement in Europe (SHARE) Wave

  12. [12]

    zobs i = 1 zmis i = 0 # ,X i ! =X iβ+ ∆ iΩ−1 i E ui

    https://doi.org/10.6103/SHARE.w9.900. Vansteelandt S, Goetghebeur E, Kenward MG, Molenberghs G. Ignorance and uncertainty regions as inferential tools in a sensitivity analysis. Statistica Sinica. 2006;16(3):953–979. Wang X, Larson MG, Liu C. Two-stage multiple imputation with a longitudinal composite variable. BMC Medical Research Methodology. 2025;25(1)...