Causal Inference with Multiple Misclassified Exposures: A Control Variate-Adjusted Calibration Weighting Approach

Brandie D. Wagner; Jordana E. Hoppe; Kayleigh P. Keller; Keith Barnatchez; Kevin P. Josey; Nandini Murali

arxiv: 2606.23656 · v1 · pith:ZOS5XKPSnew · submitted 2026-06-22 · 📊 stat.ME · stat.AP

Causal Inference with Multiple Misclassified Exposures: A Control Variate-Adjusted Calibration Weighting Approach

Nandini Murali , Keith Barnatchez , Jordana E. Hoppe , Brandie D. Wagner , Kayleigh P. Keller , Kevin P. Josey This is my paper

Pith reviewed 2026-06-26 07:12 UTC · model grok-4.3

classification 📊 stat.ME stat.AP

keywords causal inferenceexposure misclassificationcalibration weightingcontrol variatesdouble robustnessbinary exposuresclustered datacystic fibrosis

0 comments

The pith

Calibration weighting that treats misclassification as missing data, combined with control variates, produces doubly robust causal estimates for multiple binary exposures without modeling the error process.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops estimators for causal effects of multiple binary exposures, such as bacterial infections, when the exposures are observed through imperfect tests that produce misclassification. It frames the problem as one of missing data so that calibration weights deliver consistent estimates without needing to specify how the classification errors occur. A control variate step then uses the cheaper error-prone measurements to shrink variance while keeping the gold-standard consistency intact. The resulting estimator inherits double robustness from its pieces. The authors also derive an upper bound on how much variance reduction is possible when two exposures must be correctly classified together, and they demonstrate the method on cystic fibrosis data where swab measurements substantially attenuate the estimated effect of Pseudomonas on lung function.

Core claim

We develop calibration weighting and control variate estimators for causal inference with multiple misclassified binary exposures and clustered observations. The calibration approach treats misclassification as a missing data problem, achieving consistency without modelling the misclassification mechanism. The control variate adjustment integrates information from error-prone observations to reduce variance while preserving the consistency of the gold-standard estimator. We show that the resulting estimator inherits double robustness from its component estimators. We also characterize a structural ceiling on efficiency gains in the bivariate setting, where joint correct classification of bot

What carries the argument

Calibration weighting estimator that recasts misclassification as missing data, adjusted by control variates drawn from the error-prone measurements.

If this is right

The estimator remains consistent for the causal effect even if the outcome or exposure model is misspecified, provided the calibration weights are correctly formed.
Double robustness holds for the combined control-variate version.
In the two-exposure case, the maximum variance reduction is bounded by the joint probability that both exposures are correctly classified.
Application to the cystic fibrosis cohort shows swab-based estimates of the Pseudomonas effect on FEV1 are attenuated by about 69 percent relative to sputum-based estimates.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The missing-data framing may extend naturally to settings with three or more misclassified binary exposures if the calibration step can be generalized.
The efficiency ceiling implies that pairing a high-quality but expensive measure with a cheap noisy one will yield smaller gains when the two exposures are strongly associated.
The approach could be tested in other clustered observational studies where multiple risk factors are recorded with different levels of accuracy, such as electronic health records.

Load-bearing premise

Misclassification can be handled as a missing-data problem that yields consistent estimates without any model for how the errors arise.

What would settle it

A data-generating process in which the gold-standard observations are a random subsample but the probability of correct classification depends on factors unobserved in the calibration step, producing persistent bias in the weighted estimator.

Figures

Figures reproduced from arXiv: 2606.23656 by Brandie D. Wagner, Jordana E. Hoppe, Kayleigh P. Keller, Keith Barnatchez, Kevin P. Josey, Nandini Murali.

**Figure 1.** Figure 1: Bias for the joint exposure effect τ (1,1) across measurement error configurations and model specifications. 34 [PITH_FULL_IMAGE:figures/full_fig_p034_1.png] view at source ↗

**Figure 2.** Figure 2: Root mean squared error (RMSE) for the joint exposure effect [PITH_FULL_IMAGE:figures/full_fig_p035_2.png] view at source ↗

**Figure 3.** Figure 3: Empirical coverage probability of nominal 95% confidence intervals. The dashed [PITH_FULL_IMAGE:figures/full_fig_p036_3.png] view at source ↗

**Figure 4.** Figure 4: Relative efficiency of the control variate-adjusted joint estimator [PITH_FULL_IMAGE:figures/full_fig_p036_4.png] view at source ↗

**Figure 5.** Figure 5: Estimated average treatment effects of pathogen exposures on percent predicted [PITH_FULL_IMAGE:figures/full_fig_p037_5.png] view at source ↗

read the original abstract

Exposure misclassification is a common concern in studies of respiratory infections in cystic fibrosis. Throat swabs are frequently used in place of expectorated or induced sputum cultures, although they have imperfect sensitivity and specificity to detect Pseudomonas aeruginosa and Staphylococcus aureus. We develop calibration weighting and control variate estimators for causal inference with multiple misclassified binary exposures and clustered observations. The calibration approach treats misclassification as a missing data problem, achieving consistency without modelling the misclassification mechanism. The control variate adjustment integrates information from error-prone observations to reduce variance while preserving the consistency of the gold-standard estimator. We show that the resulting estimator inherits double robustness from its component estimators. We also characterize a structural ceiling on efficiency gains in the bivariate setting, where joint correct classification of both exposures limits the variance reduction achievable relative to univariate applications. Simulation studies confirm the consistency and double robustness of the proposed estimators under model misspecification. We then apply these methods to a cohort of $651$ cystic fibrosis patients ages $6$-$21$. Swab-based estimates attenuate the effect of P. aeruginosa on percent predicted FEV$_1$ by approximately $69\%$ relative to sputum-based estimates ($-2.67$ vs. $-8.52$ percentage points; $95\%$ CI for sputum: $-13.40$, $-3.63$). These findings suggest that relying on throat swabs may lead to under-treatment of P. aeruginosa infections. More broadly, the methods provide a framework for causal inference with multiple misclassified exposures.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper extends calibration weighting plus control variates to two misclassified binary exposures in clustered data and derives a bivariate efficiency bound; the CF application shows clear attenuation but the clustering adjustment needs checking.

read the letter

The core new piece is an estimator that merges calibration weighting with control variate adjustment for the two-exposure misclassification case, plus a structural bound on how much variance reduction you can get when both exposures must be correctly classified together. That bound is not just a routine extension of the univariate version.

The work does what it sets out to do on the simulation side: the abstract reports consistency and double robustness under misspecification, and the real-data example on 651 cystic fibrosis patients is concrete. The sputum-based effect of P. aeruginosa on FEV1 is roughly three times larger than the swab-based one, which is the kind of result that matters for practice.

The soft spot is the clustering. The method treats misclassification as missing data to avoid modeling the error process, but the two binary indicators are likely dependent within patients. Standard calibration equations derived under independent missingness can break when that dependence is present, and the abstract does not spell out whether the estimating equations or variance formulas were adjusted at the cluster level. If they were not, the double-robustness claim would not automatically carry over. The simulations are said to confirm performance, but without seeing the exact data-generating process for the clusters it is hard to judge how severe the issue is.

This is aimed at epidemiologists and biostatisticians who routinely face imperfect binary exposures in longitudinal or clustered studies. It is worth sending to peer review because the target problem is real, the proposed fix is new, and the application gives a clear before-and-after comparison. A referee can sort out the clustering detail and check the efficiency bound derivation.

Referee Report

2 major / 2 minor

Summary. The paper develops calibration weighting and control variate estimators for causal inference with multiple misclassified binary exposures under clustered sampling. It treats misclassification as a missing-data problem to obtain consistency without modeling the error mechanism, shows that the combined estimator inherits double robustness, derives a structural bound on efficiency gains in the bivariate case, validates the properties via simulations under misspecification, and applies the methods to cystic fibrosis data to demonstrate attenuation of the P. aeruginosa effect on FEV1 when using throat swabs versus sputum.

Significance. If the consistency and double-robustness claims hold under clustering, the work supplies a practical, assumption-light framework for a common problem in respiratory epidemiology and similar clustered studies with error-prone binary exposures. The inheritance of double robustness and the explicit characterization of the bivariate efficiency ceiling are theoretically attractive features; the real-data illustration quantifies the practical consequences of misclassification.

major comments (2)

[methods (calibration weighting derivation)] The consistency claim rests on treating misclassification as missing data so that calibration weighting delivers consistency without an explicit error model. With clustered observations and two binary exposures, the missingness indicators are plausibly dependent within clusters. The calibration equations (methods section deriving the weights) appear to be written under an independence assumption for the missingness indicators; no cluster-level calibration or cluster-robust adjustment to the estimating equations is described. This directly threatens the consistency and double-robustness inheritance when within-cluster dependence is present.
[theoretical results on double robustness] The double-robustness result is stated to be inherited from the component estimators. The paper should explicitly verify that the inheritance continues to hold once the calibration step is applied to the joint distribution of two misclassified exposures under clustering, rather than relying solely on the univariate or independent-missingness case.

minor comments (2)

[application results] The abstract and application section report point estimates and a 95% CI for the sputum-based analysis but do not state the variance estimator or whether clustering is accounted for in the reported intervals.
[notation] Notation for the multiple-exposure calibration weights and the control-variate adjustment should be introduced with a single consolidated table or display equation to improve readability.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed comments. We address each major comment below, indicating the revisions we will incorporate to strengthen the manuscript.

read point-by-point responses

Referee: [methods (calibration weighting derivation)] The consistency claim rests on treating misclassification as missing data so that calibration weighting delivers consistency without an explicit error model. With clustered observations and two binary exposures, the missingness indicators are plausibly dependent within clusters. The calibration equations (methods section deriving the weights) appear to be written under an independence assumption for the missingness indicators; no cluster-level calibration or cluster-robust adjustment to the estimating equations is described. This directly threatens the consistency and double-robustness inheritance when within-cluster dependence is present.

Authors: We appreciate the referee's identification of this potential gap. The calibration equations in the current manuscript are derived marginally at the individual level to target the required moments without an explicit misclassification model. While this delivers consistency under the stated assumptions, we acknowledge that within-cluster dependence in the missingness indicators is not explicitly addressed in the estimating equations. In the revision we will augment the calibration step with cluster-robust adjustments to the estimating equations and will add a brief theoretical note clarifying that consistency is retained under cluster dependence provided the calibration variables remain correctly specified. We will also include a targeted simulation experiment with dependent missingness within clusters to confirm finite-sample behavior. revision: yes
Referee: [theoretical results on double robustness] The double-robustness result is stated to be inherited from the component estimators. The paper should explicitly verify that the inheritance continues to hold once the calibration step is applied to the joint distribution of two misclassified exposures under clustering, rather than relying solely on the univariate or independent-missingness case.

Authors: We agree that an explicit verification under joint misclassification and clustering strengthens the theoretical contribution. The current manuscript relies on the inheritance property from the univariate control-variate and calibration estimators, but does not re-derive the cancellation of bias terms for the bivariate clustered case. In the revised version we will add a short appendix subsection that explicitly shows the double-robustness property continues to hold when the calibration weights are obtained from the joint observed-data distribution and the estimating equations are cluster-robust. The argument follows the same bias-cancellation logic as the univariate case once the joint calibration moments are correctly targeted. revision: yes

Circularity Check

0 steps flagged

No circularity; derivation extends established calibration and control-variate methods without self-referential reduction

full rationale

The paper treats misclassification as a missing-data problem to obtain consistency via calibration weighting, then augments with control variates while inheriting double robustness from the component estimators. These steps rest on standard missing-data and weighting theory applied to the clustered bivariate setting; the structural efficiency ceiling is derived from the joint classification probabilities rather than from any quantity fitted inside the present manuscript. No equation or claim reduces by construction to a parameter defined by the paper's own data or to a self-citation whose validity is presupposed. The central consistency and robustness results therefore remain independent of the target dataset's fitted values.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Based on abstract only; the central claims rest on the missing-data treatment of misclassification and the inheritance of double robustness from component estimators. No free parameters, invented entities, or additional axioms are described in the provided text.

axioms (1)

domain assumption Misclassification can be treated as a missing data problem to achieve consistency without modeling the misclassification mechanism
Stated in abstract as the basis for the calibration approach

pith-pipeline@v0.9.1-grok · 5832 in / 1284 out tokens · 25673 ms · 2026-06-26T07:12:00.166741+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

41 extracted references

[1]

Cystic fibrosis

Abdulghani Sankari and Sandeep Sharma. Cystic fibrosis. InStatPearls. StatPearls Publishing, Treasure Island, FL, 2024

2024
[2]

Clinical sig- nificance of microbial infection and adaptation in cystic fibrosis.Clinical Microbiology Reviews, 24(1):29–70, 2011

Alan R Hauser, Manu Jain, Maskit Bar-Meir, and Susanna A McColley. Clinical sig- nificance of microbial infection and adaptation in cystic fibrosis.Clinical Microbiology Reviews, 24(1):29–70, 2011

2011
[3]

Exercise and airway clearance tech- niques in cystic fibrosis.Seminars in Respiratory and Critical Care Medicine, 44(2): 209–216, 2023

Nathan Ward, Kathy Stiller, and Anne E Holland. Exercise and airway clearance tech- niques in cystic fibrosis.Seminars in Respiratory and Critical Care Medicine, 44(2): 209–216, 2023. doi: 10.1055/s-0042-1758729

doi:10.1055/s-0042-1758729 2023
[4]

Pharmacologic improvement of CFTR function rapidly decreases sputum pathogen density, but lung infections generally persist.Journal of Clinical Investigation, 133(10):e167957, 2023

David P Nichols, Sarah J Morgan, Michelle Skalland, Anh T Vo, Jill M Van Dalfsen, Sachinkumar B P Singh, Wendy Ni, Lucas R Hoffman, Kailee McGeer, Sonya L Heltshe, John P Clancy, Steven M Rowe, Peter Jorth, and Pradeep K Singh. Pharmacologic improvement of CFTR function rapidly decreases sputum pathogen density, but lung infections generally persist.Journ...

doi:10.1172/jci167957 2023
[5]

New drugs, new challenges in cystic fibrosis care.European Respiratory Review, 33(173):240045, 2024

Isabelle Fajac, Pierre-R´ egis Burgel, and Cl´ emence Martin. New drugs, new challenges in cystic fibrosis care.European Respiratory Review, 33(173):240045, 2024. doi: 10. 1183/16000617.0045-2024

arXiv 2024
[6]

Sputum induction improves detection of pathogens in children with cystic fibrosis.Pediatric Pulmonology, 50(7):638–646, 2015

Jordana E Hoppe, Elinor Towler, Brandie D Wagner, Frank J Accurso, Scott D Sagel, and Edith T Zemanick. Sputum induction improves detection of pathogens in children with cystic fibrosis.Pediatric Pulmonology, 50(7):638–646, 2015. doi: 10.1002/ppul. 23150. 25

doi:10.1002/ppul 2015
[7]

Respiratory bacterial culture sampling in expectorating and non- expectorating patients with cystic fibrosis.Frontiers in Pediatrics, 6:403, 2018

Hanneke Eyns, Denis Pi´ erard, Elke De Wachter, Leo Eeckhout, Peter Vaes, and Anne Malfroot. Respiratory bacterial culture sampling in expectorating and non- expectorating patients with cystic fibrosis.Frontiers in Pediatrics, 6:403, 2018

2018
[8]

Bacterial culture underestimates lung pathogen detection and infection status in cystic fibrosis.Microbiology Spectrum, 10(5):e00419–22, 2022

Helen Gavillet, Lauren Hatfield, Damian Rivett, Andrew Jones, Anirban Maitra, Alexander Horsley, and Christopher van der Gast. Bacterial culture underestimates lung pathogen detection and infection status in cystic fibrosis.Microbiology Spectrum, 10(5):e00419–22, 2022. doi: 10.1128/spectrum.00419-22

doi:10.1128/spectrum.00419-22 2022
[9]

Measurement error in nonlinear models: A modern perspective

Raymond J Carroll, David Ruppert, Leonard A Stefanski, and Ciprian M Crainiceanu. Measurement error in nonlinear models: A modern perspective. Chapman and Hal- l/CRC, 2006

2006
[10]

Kevin P Josey, Priyanka DeSouza, Xiao Wu, Danielle Braun, and Rachel Nethery. Estimating a causal exposure response function with a continuous error-prone exposure: A study of fine particulate matter and all-cause mortality.Journal of Agricultural, Biological and Environmental Statistics, 28(1):20–41, 2023

2023
[11]

Propensity scores with misclassified treatment assignment: a likelihood-based adjustment.Biostatistics, 18(4):695–710, 2017

Danielle Braun, Malka Gorfine, Giovanni Parmigiani, Nils D Arvold, Francesca Do- minici, and Corwin Zigler. Propensity scores with misclassified treatment assignment: a likelihood-based adjustment.Biostatistics, 18(4):695–710, 2017

2017
[12]

Causal inference in the context of an error prone exposure: Air pollution and mortality.The Annals of Applied Statistics, 13(1):520–547, 2019

Xiao Wu, Danielle Braun, Marianthi-Anna Kioumourtzoglou, Christine Choirat, Qian Di, and Francesca Dominici. Causal inference in the context of an error prone exposure: Air pollution and mortality.The Annals of Applied Statistics, 13(1):520–547, 2019

2019
[13]

Causal inference for multiple contin- uous exposures via the multivariate generalized propensity score.arXiv preprint arXiv:2008.13767, 2020

Justin R Williams and Catherine M Crespi. Causal inference for multiple contin- uous exposures via the multivariate generalized propensity score.arXiv preprint arXiv:2008.13767, 2020

arXiv 2008
[14]

An imputation-based solution to using mismeasured covariates in propensity score analysis.Statistical Methods in Medical Research, 26(4):1824–1837, 2017

Yenny Webb-Vargas, Kara E Rudolph, David Lenis, Peter Murakami, and Elizabeth A 26 Stuart. An imputation-based solution to using mismeasured covariates in propensity score analysis.Statistical Methods in Medical Research, 26(4):1824–1837, 2017

2017
[15]

Towards optimal doubly robust estimation of heterogeneous causal effects.Electronic Journal of Statistics, 17(2):3008–3049, 2023

Edward H Kennedy. Towards optimal doubly robust estimation of heterogeneous causal effects.Electronic Journal of Statistics, 17(2):3008–3049, 2023. Earlier arXiv preprint: 2004.14497, 2020

arXiv 2023
[16]

Connections between survey calibration estimators and semiparametric models for incomplete data.International Statistical Review, 79(2):200–220, 2011

Thomas Lumley, Pamela A Shaw, and James Y Dai. Connections between survey calibration estimators and semiparametric models for incomplete data.International Statistical Review, 79(2):200–220, 2011

2011
[17]

All your data are always miss- ing: incorporating bias due to measurement error into the potential outcomes frame- work.International Journal of Epidemiology, 44(4):1452–1459, 2015

Jessie K Edwards, Stephen R Cole, and Daniel Westreich. All your data are always miss- ing: incorporating bias due to measurement error into the potential outcomes frame- work.International Journal of Epidemiology, 44(4):1452–1459, 2015

2015
[18]

Im- proving trial generalizability using observational studies.Biometrics, 79(2):1213–1225,

Dasom Lee, Shu Yang, Lin Dong, Xiaofei Wang, Donglin Zeng, and Jianwen Cai. Im- proving trial generalizability using observational studies.Biometrics, 79(2):1213–1225,
[19]

doi: 10.1111/biom.13609

doi:10.1111/biom.13609
[20]

Combining multiple observational data sources to estimate causal effects.Journal of the American Statistical Association, 115(531):1540–1554, 2020

Shu Yang and Peng Ding. Combining multiple observational data sources to estimate causal effects.Journal of the American Statistical Association, 115(531):1540–1554, 2020

2020
[21]

Flexible and efficient estimation of causal effects with error-prone ex- posures: a control variates approach for measurement error.Biometrics, 81(4):ujaf151, 2025

Keith Barnatchez, Rachel Nethery, Bryan E Shepherd, Giovanni Parmigiani, and Kevin P Josey. Flexible and efficient estimation of causal effects with error-prone ex- posures: a control variates approach for measurement error.Biometrics, 81(4):ujaf151, 2025

2025
[22]

Estimating causal effects of treatments in randomized and nonran- domized studies.Journal of Educational Psychology, 66(5):688–701, 1974

Donald B Rubin. Estimating causal effects of treatments in randomized and nonran- domized studies.Journal of Educational Psychology, 66(5):688–701, 1974. 27

1974
[23]

Causal interaction in factorial experiments: Application to conjoint analysis.Journal of the American Statistical Association, 114(526):529–540,

Naoki Egami and Kosuke Imai. Causal interaction in factorial experiments: Application to conjoint analysis.Journal of the American Statistical Association, 114(526):529–540,
[24]

doi: 10.1080/01621459.2018.1476246

doi:10.1080/01621459.2018.1476246 2018
[25]

Calibration estimators in survey sampling

Jean-Claude Deville and Carl-Erik S¨ arndal. Calibration estimators in survey sampling. Journal of the American Statistical Association, 87(418):376–382, 1992

1992
[26]

Semiparametric efficiency in multivariate re- gression models with missing data.Journal of the American Statistical Association, 90 (429):122–129, 1995

James M Robins and Andrea Rotnitzky. Semiparametric efficiency in multivariate re- gression models with missing data.Journal of the American Statistical Association, 90 (429):122–129, 1995

1995
[27]

Doubly robust estimation in missing data and causal inference models.Biometrics, 61(4):962–973, 2005

Heejung Bang and James M Robins. Doubly robust estimation in missing data and causal inference models.Biometrics, 61(4):962–973, 2005

2005
[28]

Doubly robust estimation of causal effects.American Journal of Epidemiology, 173(7):761–767, 2011

Michele Jonsson Funk, Daniel Westreich, Chris Wiesen, Til St¨ urmer, M Alan Brookhart, and Marie Davidian. Doubly robust estimation of causal effects.American Journal of Epidemiology, 173(7):761–767, 2011

2011
[29]

Combining multiple imputation with raking of weights: An efficient and robust approach in the setting of nearly true models.Statistics in Medicine, 40(30):6777–6791, 2021

Kyunghee Han, Pamela A Shaw, and Thomas Lumley. Combining multiple imputation with raking of weights: An efficient and robust approach in the setting of nearly true models.Statistics in Medicine, 40(30):6777–6791, 2021. doi: 10.1002/sim.9210

doi:10.1002/sim.9210 2021
[30]

A framework for covariate balance using Bregman distances.Scandinavian Journal of Statistics, 48 (3):790–816, 2021

Kevin P Josey, Elizabeth Juarez-Colunga, Fan Yang, and Debashis Ghosh. A framework for covariate balance using Bregman distances.Scandinavian Journal of Statistics, 48 (3):790–816, 2021. doi: 10.1111/sjos.12457

doi:10.1111/sjos.12457 2021
[31]

Generalized raking pro- cedures in survey sampling.Journal of the American Statistical Association, 88(423): 1013–1020, 1993

Jean-Claude Deville, Carl-Erik S¨ arndal, and Olivier Sautory. Generalized raking pro- cedures in survey sampling.Journal of the American Statistical Association, 88(423): 1013–1020, 1993

1993
[32]

Trans- 28 porting experimental results with entropy balancing.Statistics in Medicine, 40(19): 4310–4326, 2021

Kevin P Josey, Seth A Berkowitz, Debashis Ghosh, and Sridharan Raghavan. Trans- 28 porting experimental results with entropy balancing.Statistics in Medicine, 40(19): 4310–4326, 2021. doi: 10.1002/sim.9031

doi:10.1002/sim.9031 2021
[33]

A calibration approach to transportability and data-fusion with observational data.Statistics in Medicine, 41(23):4511–4531, 2022

Kevin P Josey, Fan Yang, Debashis Ghosh, and Sridharan Raghavan. A calibration approach to transportability and data-fusion with observational data.Statistics in Medicine, 41(23):4511–4531, 2022. doi: 10.1002/sim.9523

doi:10.1002/sim.9523 2022
[34]

Multivariate adaptive regression splines.The Annals of Statistics, 19(1):1–67, 1991

Jerome H Friedman. Multivariate adaptive regression splines.The Annals of Statistics, 19(1):1–67, 1991

1991
[35]

An introduction to g methods

Ashley I Naimi, Stephen R Cole, and Edward H Kennedy. An introduction to g methods. International Journal of Epidemiology, 46(2):756–762, 2017

2017
[36]

Object-oriented computation of sandwich estimators.Journal of Statis- tical Software, 16(9):1–16, 2006

Achim Zeileis. Object-oriented computation of sandwich estimators.Journal of Statis- tical Software, 16(9):1–16, 2006

2006
[37]

Efficiency of multivariate control variates in Monte Carlo simulation.Operations Research, 33(3):661–677, 1985

Reuven Y Rubinstein and Ruth Marcus. Efficiency of multivariate control variates in Monte Carlo simulation.Operations Research, 33(3):661–677, 1985

1985
[38]

Duxbury/Thomson Learning, Pacific Grove, CA, 2nd edition, 2002

George Casella and Roger L Berger.Statistical Inference. Duxbury/Thomson Learning, Pacific Grove, CA, 2nd edition, 2002

2002
[39]

Use of FEV1 in cystic fibrosis epidemiologic studies and clinical trials: a statistical perspective for the clinical researcher.Journal of Cystic Fibrosis, 16(3):318–326, 2017

Rhonda Szczesniak, Sonya L Heltshe, Sanja Stanojevic, and Nicole Mayer-Hamblett. Use of FEV1 in cystic fibrosis epidemiologic studies and clinical trials: a statistical perspective for the clinical researcher.Journal of Cystic Fibrosis, 16(3):318–326, 2017

2017
[40]

Multi- ethnic reference values for spirometry for the 3–95-yr age range: the global lung function 2012 equations.European Respiratory Journal, 40(6):1324–1343, 2012

Philip H Quanjer, Sanja Stanojevic, Tim J Cole, Xaver Baur, Graham L Hall, Bruce H Culver, Paul L Enright, John L Hankinson, Mary SM Ip, Jinping Zheng, et al. Multi- ethnic reference values for spirometry for the 3–95-yr age range: the global lung function 2012 equations.European Respiratory Journal, 40(6):1324–1343, 2012

2012
[41]

Efficient estimation of causal effects under two- 29 phase sampling with error-prone outcome and treatment measurements, 2025

Keith Barnatchez, Kevin P Josey, Nima S Hejazi, Bryan E Shepherd, Giovanni Parmigiani, and Rachel C Nethery. Efficient estimation of causal effects under two- 29 phase sampling with error-prone outcome and treatment measurements, 2025. URL https://arxiv.org/abs/2506.21777. 30 Tables Sputum (N= 5,434) Throat Swab (N= 7,537) Percent Predicted FEV1 Mean (SD)...

arXiv 2025

[1] [1]

Cystic fibrosis

Abdulghani Sankari and Sandeep Sharma. Cystic fibrosis. InStatPearls. StatPearls Publishing, Treasure Island, FL, 2024

2024

[2] [2]

Clinical sig- nificance of microbial infection and adaptation in cystic fibrosis.Clinical Microbiology Reviews, 24(1):29–70, 2011

Alan R Hauser, Manu Jain, Maskit Bar-Meir, and Susanna A McColley. Clinical sig- nificance of microbial infection and adaptation in cystic fibrosis.Clinical Microbiology Reviews, 24(1):29–70, 2011

2011

[3] [3]

Exercise and airway clearance tech- niques in cystic fibrosis.Seminars in Respiratory and Critical Care Medicine, 44(2): 209–216, 2023

Nathan Ward, Kathy Stiller, and Anne E Holland. Exercise and airway clearance tech- niques in cystic fibrosis.Seminars in Respiratory and Critical Care Medicine, 44(2): 209–216, 2023. doi: 10.1055/s-0042-1758729

doi:10.1055/s-0042-1758729 2023

[4] [4]

Pharmacologic improvement of CFTR function rapidly decreases sputum pathogen density, but lung infections generally persist.Journal of Clinical Investigation, 133(10):e167957, 2023

David P Nichols, Sarah J Morgan, Michelle Skalland, Anh T Vo, Jill M Van Dalfsen, Sachinkumar B P Singh, Wendy Ni, Lucas R Hoffman, Kailee McGeer, Sonya L Heltshe, John P Clancy, Steven M Rowe, Peter Jorth, and Pradeep K Singh. Pharmacologic improvement of CFTR function rapidly decreases sputum pathogen density, but lung infections generally persist.Journ...

doi:10.1172/jci167957 2023

[5] [5]

New drugs, new challenges in cystic fibrosis care.European Respiratory Review, 33(173):240045, 2024

Isabelle Fajac, Pierre-R´ egis Burgel, and Cl´ emence Martin. New drugs, new challenges in cystic fibrosis care.European Respiratory Review, 33(173):240045, 2024. doi: 10. 1183/16000617.0045-2024

arXiv 2024

[6] [6]

Sputum induction improves detection of pathogens in children with cystic fibrosis.Pediatric Pulmonology, 50(7):638–646, 2015

Jordana E Hoppe, Elinor Towler, Brandie D Wagner, Frank J Accurso, Scott D Sagel, and Edith T Zemanick. Sputum induction improves detection of pathogens in children with cystic fibrosis.Pediatric Pulmonology, 50(7):638–646, 2015. doi: 10.1002/ppul. 23150. 25

doi:10.1002/ppul 2015

[7] [7]

Respiratory bacterial culture sampling in expectorating and non- expectorating patients with cystic fibrosis.Frontiers in Pediatrics, 6:403, 2018

Hanneke Eyns, Denis Pi´ erard, Elke De Wachter, Leo Eeckhout, Peter Vaes, and Anne Malfroot. Respiratory bacterial culture sampling in expectorating and non- expectorating patients with cystic fibrosis.Frontiers in Pediatrics, 6:403, 2018

2018

[8] [8]

Bacterial culture underestimates lung pathogen detection and infection status in cystic fibrosis.Microbiology Spectrum, 10(5):e00419–22, 2022

Helen Gavillet, Lauren Hatfield, Damian Rivett, Andrew Jones, Anirban Maitra, Alexander Horsley, and Christopher van der Gast. Bacterial culture underestimates lung pathogen detection and infection status in cystic fibrosis.Microbiology Spectrum, 10(5):e00419–22, 2022. doi: 10.1128/spectrum.00419-22

doi:10.1128/spectrum.00419-22 2022

[9] [9]

Measurement error in nonlinear models: A modern perspective

Raymond J Carroll, David Ruppert, Leonard A Stefanski, and Ciprian M Crainiceanu. Measurement error in nonlinear models: A modern perspective. Chapman and Hal- l/CRC, 2006

2006

[10] [10]

Kevin P Josey, Priyanka DeSouza, Xiao Wu, Danielle Braun, and Rachel Nethery. Estimating a causal exposure response function with a continuous error-prone exposure: A study of fine particulate matter and all-cause mortality.Journal of Agricultural, Biological and Environmental Statistics, 28(1):20–41, 2023

2023

[11] [11]

Propensity scores with misclassified treatment assignment: a likelihood-based adjustment.Biostatistics, 18(4):695–710, 2017

Danielle Braun, Malka Gorfine, Giovanni Parmigiani, Nils D Arvold, Francesca Do- minici, and Corwin Zigler. Propensity scores with misclassified treatment assignment: a likelihood-based adjustment.Biostatistics, 18(4):695–710, 2017

2017

[12] [12]

Causal inference in the context of an error prone exposure: Air pollution and mortality.The Annals of Applied Statistics, 13(1):520–547, 2019

Xiao Wu, Danielle Braun, Marianthi-Anna Kioumourtzoglou, Christine Choirat, Qian Di, and Francesca Dominici. Causal inference in the context of an error prone exposure: Air pollution and mortality.The Annals of Applied Statistics, 13(1):520–547, 2019

2019

[13] [13]

Causal inference for multiple contin- uous exposures via the multivariate generalized propensity score.arXiv preprint arXiv:2008.13767, 2020

Justin R Williams and Catherine M Crespi. Causal inference for multiple contin- uous exposures via the multivariate generalized propensity score.arXiv preprint arXiv:2008.13767, 2020

arXiv 2008

[14] [14]

An imputation-based solution to using mismeasured covariates in propensity score analysis.Statistical Methods in Medical Research, 26(4):1824–1837, 2017

Yenny Webb-Vargas, Kara E Rudolph, David Lenis, Peter Murakami, and Elizabeth A 26 Stuart. An imputation-based solution to using mismeasured covariates in propensity score analysis.Statistical Methods in Medical Research, 26(4):1824–1837, 2017

2017

[15] [15]

Towards optimal doubly robust estimation of heterogeneous causal effects.Electronic Journal of Statistics, 17(2):3008–3049, 2023

Edward H Kennedy. Towards optimal doubly robust estimation of heterogeneous causal effects.Electronic Journal of Statistics, 17(2):3008–3049, 2023. Earlier arXiv preprint: 2004.14497, 2020

arXiv 2023

[16] [16]

Connections between survey calibration estimators and semiparametric models for incomplete data.International Statistical Review, 79(2):200–220, 2011

Thomas Lumley, Pamela A Shaw, and James Y Dai. Connections between survey calibration estimators and semiparametric models for incomplete data.International Statistical Review, 79(2):200–220, 2011

2011

[17] [17]

All your data are always miss- ing: incorporating bias due to measurement error into the potential outcomes frame- work.International Journal of Epidemiology, 44(4):1452–1459, 2015

Jessie K Edwards, Stephen R Cole, and Daniel Westreich. All your data are always miss- ing: incorporating bias due to measurement error into the potential outcomes frame- work.International Journal of Epidemiology, 44(4):1452–1459, 2015

2015

[18] [18]

Im- proving trial generalizability using observational studies.Biometrics, 79(2):1213–1225,

Dasom Lee, Shu Yang, Lin Dong, Xiaofei Wang, Donglin Zeng, and Jianwen Cai. Im- proving trial generalizability using observational studies.Biometrics, 79(2):1213–1225,

[19] [19]

doi: 10.1111/biom.13609

doi:10.1111/biom.13609

[20] [20]

Combining multiple observational data sources to estimate causal effects.Journal of the American Statistical Association, 115(531):1540–1554, 2020

Shu Yang and Peng Ding. Combining multiple observational data sources to estimate causal effects.Journal of the American Statistical Association, 115(531):1540–1554, 2020

2020

[21] [21]

Flexible and efficient estimation of causal effects with error-prone ex- posures: a control variates approach for measurement error.Biometrics, 81(4):ujaf151, 2025

Keith Barnatchez, Rachel Nethery, Bryan E Shepherd, Giovanni Parmigiani, and Kevin P Josey. Flexible and efficient estimation of causal effects with error-prone ex- posures: a control variates approach for measurement error.Biometrics, 81(4):ujaf151, 2025

2025

[22] [22]

Estimating causal effects of treatments in randomized and nonran- domized studies.Journal of Educational Psychology, 66(5):688–701, 1974

Donald B Rubin. Estimating causal effects of treatments in randomized and nonran- domized studies.Journal of Educational Psychology, 66(5):688–701, 1974. 27

1974

[23] [23]

Causal interaction in factorial experiments: Application to conjoint analysis.Journal of the American Statistical Association, 114(526):529–540,

Naoki Egami and Kosuke Imai. Causal interaction in factorial experiments: Application to conjoint analysis.Journal of the American Statistical Association, 114(526):529–540,

[24] [24]

doi: 10.1080/01621459.2018.1476246

doi:10.1080/01621459.2018.1476246 2018

[25] [25]

Calibration estimators in survey sampling

Jean-Claude Deville and Carl-Erik S¨ arndal. Calibration estimators in survey sampling. Journal of the American Statistical Association, 87(418):376–382, 1992

1992

[26] [26]

Semiparametric efficiency in multivariate re- gression models with missing data.Journal of the American Statistical Association, 90 (429):122–129, 1995

James M Robins and Andrea Rotnitzky. Semiparametric efficiency in multivariate re- gression models with missing data.Journal of the American Statistical Association, 90 (429):122–129, 1995

1995

[27] [27]

Doubly robust estimation in missing data and causal inference models.Biometrics, 61(4):962–973, 2005

Heejung Bang and James M Robins. Doubly robust estimation in missing data and causal inference models.Biometrics, 61(4):962–973, 2005

2005

[28] [28]

Doubly robust estimation of causal effects.American Journal of Epidemiology, 173(7):761–767, 2011

Michele Jonsson Funk, Daniel Westreich, Chris Wiesen, Til St¨ urmer, M Alan Brookhart, and Marie Davidian. Doubly robust estimation of causal effects.American Journal of Epidemiology, 173(7):761–767, 2011

2011

[29] [29]

Combining multiple imputation with raking of weights: An efficient and robust approach in the setting of nearly true models.Statistics in Medicine, 40(30):6777–6791, 2021

Kyunghee Han, Pamela A Shaw, and Thomas Lumley. Combining multiple imputation with raking of weights: An efficient and robust approach in the setting of nearly true models.Statistics in Medicine, 40(30):6777–6791, 2021. doi: 10.1002/sim.9210

doi:10.1002/sim.9210 2021

[30] [30]

A framework for covariate balance using Bregman distances.Scandinavian Journal of Statistics, 48 (3):790–816, 2021

Kevin P Josey, Elizabeth Juarez-Colunga, Fan Yang, and Debashis Ghosh. A framework for covariate balance using Bregman distances.Scandinavian Journal of Statistics, 48 (3):790–816, 2021. doi: 10.1111/sjos.12457

doi:10.1111/sjos.12457 2021

[31] [31]

Generalized raking pro- cedures in survey sampling.Journal of the American Statistical Association, 88(423): 1013–1020, 1993

Jean-Claude Deville, Carl-Erik S¨ arndal, and Olivier Sautory. Generalized raking pro- cedures in survey sampling.Journal of the American Statistical Association, 88(423): 1013–1020, 1993

1993

[32] [32]

Trans- 28 porting experimental results with entropy balancing.Statistics in Medicine, 40(19): 4310–4326, 2021

Kevin P Josey, Seth A Berkowitz, Debashis Ghosh, and Sridharan Raghavan. Trans- 28 porting experimental results with entropy balancing.Statistics in Medicine, 40(19): 4310–4326, 2021. doi: 10.1002/sim.9031

doi:10.1002/sim.9031 2021

[33] [33]

A calibration approach to transportability and data-fusion with observational data.Statistics in Medicine, 41(23):4511–4531, 2022

Kevin P Josey, Fan Yang, Debashis Ghosh, and Sridharan Raghavan. A calibration approach to transportability and data-fusion with observational data.Statistics in Medicine, 41(23):4511–4531, 2022. doi: 10.1002/sim.9523

doi:10.1002/sim.9523 2022

[34] [34]

Multivariate adaptive regression splines.The Annals of Statistics, 19(1):1–67, 1991

Jerome H Friedman. Multivariate adaptive regression splines.The Annals of Statistics, 19(1):1–67, 1991

1991

[35] [35]

An introduction to g methods

Ashley I Naimi, Stephen R Cole, and Edward H Kennedy. An introduction to g methods. International Journal of Epidemiology, 46(2):756–762, 2017

2017

[36] [36]

Object-oriented computation of sandwich estimators.Journal of Statis- tical Software, 16(9):1–16, 2006

Achim Zeileis. Object-oriented computation of sandwich estimators.Journal of Statis- tical Software, 16(9):1–16, 2006

2006

[37] [37]

Efficiency of multivariate control variates in Monte Carlo simulation.Operations Research, 33(3):661–677, 1985

Reuven Y Rubinstein and Ruth Marcus. Efficiency of multivariate control variates in Monte Carlo simulation.Operations Research, 33(3):661–677, 1985

1985

[38] [38]

Duxbury/Thomson Learning, Pacific Grove, CA, 2nd edition, 2002

George Casella and Roger L Berger.Statistical Inference. Duxbury/Thomson Learning, Pacific Grove, CA, 2nd edition, 2002

2002

[39] [39]

Use of FEV1 in cystic fibrosis epidemiologic studies and clinical trials: a statistical perspective for the clinical researcher.Journal of Cystic Fibrosis, 16(3):318–326, 2017

Rhonda Szczesniak, Sonya L Heltshe, Sanja Stanojevic, and Nicole Mayer-Hamblett. Use of FEV1 in cystic fibrosis epidemiologic studies and clinical trials: a statistical perspective for the clinical researcher.Journal of Cystic Fibrosis, 16(3):318–326, 2017

2017

[40] [40]

Multi- ethnic reference values for spirometry for the 3–95-yr age range: the global lung function 2012 equations.European Respiratory Journal, 40(6):1324–1343, 2012

Philip H Quanjer, Sanja Stanojevic, Tim J Cole, Xaver Baur, Graham L Hall, Bruce H Culver, Paul L Enright, John L Hankinson, Mary SM Ip, Jinping Zheng, et al. Multi- ethnic reference values for spirometry for the 3–95-yr age range: the global lung function 2012 equations.European Respiratory Journal, 40(6):1324–1343, 2012

2012

[41] [41]

Efficient estimation of causal effects under two- 29 phase sampling with error-prone outcome and treatment measurements, 2025

Keith Barnatchez, Kevin P Josey, Nima S Hejazi, Bryan E Shepherd, Giovanni Parmigiani, and Rachel C Nethery. Efficient estimation of causal effects under two- 29 phase sampling with error-prone outcome and treatment measurements, 2025. URL https://arxiv.org/abs/2506.21777. 30 Tables Sputum (N= 5,434) Throat Swab (N= 7,537) Percent Predicted FEV1 Mean (SD)...

arXiv 2025