Estimating treatment duration effects via clone-censor-weight: a breast cancer case study
Pith reviewed 2026-05-20 03:41 UTC · model grok-4.3
The pith
The cloning-censoring-weighting framework emulates target trials to estimate effects of different treatment durations in observational survival data.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Under the stated assumptions, cloning individuals to represent each treatment duration strategy, censoring them when they deviate from the assigned strategy, and weighting to account for the induced censoring produces consistent estimates of the duration effects; this holds for both baseline-only confounding and time-varying confounding settings, as verified in simulations and illustrated in the breast cancer data where estimates for 2 versus 5 years of tamoxifen carry wide uncertainty.
What carries the argument
The cloning-censoring-weighting (CCW) framework, which creates copies of each patient record for each strategy, artificially censors records at the first deviation from the assigned duration, and applies inverse probability of censoring weights to recover the target trial contrast.
If this is right
- After cloning and censoring, doubly robust estimators remain consistent even if either the outcome or censoring model is misspecified.
- In the breast cancer cohort the 2-year strategy has limited support, which widens uncertainty and requires sensitivity checks.
- The framework extends naturally to other static duration questions in longitudinal observational studies with time-varying covariates.
- Simulation results indicate that misspecification of the censoring model increases variability more than bias when the other models are correct.
Where Pith is reading between the lines
- The same cloning step could be used to study duration effects in other chronic conditions where treatment is intended to continue until an event or a fixed horizon.
- Future work could test whether the distinction between artificial and natural censoring remains clear when follow-up lengths vary widely across registries.
- Combining CCW with machine-learning nuisance models might reduce the uncertainty seen in the small-event breast cancer application.
Load-bearing premise
Treatment admissibility, relaxed intervention rules, and the correct separation of artificial from natural censoring must hold in the observed data.
What would settle it
If a randomized trial of 2 versus 5 years of adjuvant tamoxifen produces materially different survival curves from those obtained via CCW on the same population, the framework's estimates would be called into question.
read the original abstract
In this work, we study the estimation of treatment duration effects in observational survival data, where treatment and covariate histories evolve over time and longer observed durations are only attainable among individuals who remain event-free and under follow-up, leading to immortal time bias under naive analyses. The cloning-censoring-weighting (CCW) framework provides a practical approach to emulate target trials of treatment duration strategies, but several methodological aspects remain insufficiently understood. We focus on static treatment duration strategies under two settings of increasing complexity: baseline confounding only, and confounding with time-varying covariates. We formalize the assumptions underlying CCW, with particular emphasis on treatment admissibility, relaxed intervention rules, and the distinction between artificial and natural censoring. We then compare several estimation approaches after cloning and censoring, including inverse probability of censoring weighting (IPCW), the G-formula, and doubly robust estimators, through simulation studies assessing robustness, variability, and sensitivity to censoring model misspecification. Finally, we apply the framework to a Breast Cancer cohort to emulate a target trial comparing 2 versus 5 years of adjuvant tamoxifen in early stage breast cancer. Due to the small number of events and limited support for the 2-year strategy, estimates are associated with substantial uncertainty. These findings highlight both the practical relevance and the limitations of CCW, and underscore the importance of sensitivity analyses in complex longitudinal observational settings.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript claims that the cloning-censoring-weighting (CCW) framework offers a practical way to emulate target trials for static treatment duration strategies in observational survival data subject to immortal time bias and time-varying confounding. It formalizes assumptions including treatment admissibility, relaxed interventions, and the distinction between artificial and natural censoring; compares IPCW, G-formula, and doubly robust estimators via simulations that assess robustness to censoring-model misspecification; and applies the approach to a breast cancer cohort to compare 2 versus 5 years of adjuvant tamoxifen, reporting substantial uncertainty attributable to limited support for the shorter strategy.
Significance. If the formalization and simulation results hold, the work supplies useful methodological guidance for applying CCW to duration questions in longitudinal observational settings. The simulations provide concrete evidence on estimator behavior under misspecification, while the breast cancer example illustrates both the relevance of the framework and the practical limits imposed by positivity and sample size. Explicit acknowledgment of these limits and the call for sensitivity analyses strengthen the contribution to causal inference practice in pharmacoepidemiology.
major comments (1)
- [Application section (breast cancer cohort)] Application section (breast cancer cohort): the abstract states that limited support for the 2-year strategy produces estimates with substantial uncertainty. To substantiate the claim that CCW remains practically usable, the manuscript must report diagnostics for the positivity assumption (e.g., the distribution of inverse-probability weights for the 2-year arm, the proportion of covariate histories with near-zero probability of following the 2-year regime, or any truncation rules applied). Without these, it is impossible to distinguish whether the reported uncertainty arises solely from few events or is inflated by unstable weights, directly affecting the central claim of practical applicability.
minor comments (2)
- [Simulation studies section] Simulation studies section: the description of the data-generating mechanisms and the specific parameter values used to induce censoring-model misspecification should be expanded so that readers can reproduce the reported robustness findings.
- Notation: ensure that all acronyms (CCW, IPCW, DR) are defined at first appearance in the main text and that the distinction between artificial and natural censoring is illustrated with a small numerical example.
Simulated Author's Rebuttal
We thank the referee for the thoughtful review and for highlighting the need for explicit positivity diagnostics in the breast cancer application. We agree that these details are important for interpreting the reported uncertainty and for supporting the claim of practical applicability. We address the major comment below and will incorporate the requested information in the revised manuscript.
read point-by-point responses
-
Referee: Application section (breast cancer cohort): the abstract states that limited support for the 2-year strategy produces estimates with substantial uncertainty. To substantiate the claim that CCW remains practically usable, the manuscript must report diagnostics for the positivity assumption (e.g., the distribution of inverse-probability weights for the 2-year arm, the proportion of covariate histories with near-zero probability of following the 2-year regime, or any truncation rules applied). Without these, it is impossible to distinguish whether the reported uncertainty arises solely from few events or is inflated by unstable weights, directly affecting the central claim of practical applicability.
Authors: We agree that reporting positivity diagnostics is necessary to clarify the sources of uncertainty. In the revised manuscript we will add a dedicated paragraph (or short subsection) in the application section that presents: (i) summary statistics and the empirical distribution of the inverse-probability-of-censoring weights for the 2-year arm (mean, median, 95th percentile, maximum); (ii) the proportion of observed covariate histories whose estimated probability of following the 2-year regime falls below a small threshold (e.g., 0.01 or 0.05); and (iii) any weight truncation or stabilization rules that were applied. These additions will allow readers to assess whether the wide confidence intervals are driven primarily by the small number of events or by unstable weights. We believe this will strengthen rather than weaken the central claim that CCW is practically usable while transparently acknowledging its limits in this data set. revision: yes
Circularity Check
No circularity: CCW estimates derived from data under external assumptions
full rationale
The paper formalizes standard causal assumptions for the CCW framework (treatment admissibility, artificial vs natural censoring), evaluates IPCW/G-formula/DR estimators via independent simulation studies, and applies them to the breast cancer cohort to produce estimates with reported uncertainty. No step reduces a claimed result to a fitted parameter or self-citation by construction; the target-trial emulation and numerical outputs are obtained directly from the observational data under the stated (non-derived) assumptions.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption No unmeasured confounding for treatment and censoring processes
- domain assumption Treatment admissibility and relaxed intervention rules hold
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
cloning–censoring–weighting (CCW) framework ... inverse probability of censoring weighting (IPCW), the G-formula, and doubly robust estimators
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanabsolute_floor_iff_bare_distinguishability unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Positivity (treatment) P(¯A∈Ad|X)>0 ... Positivity (censoring)
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Sox, Harold C. and Lewis, Roger J. , title =. JAMA , volume =. 2016 , month =. doi:10.1001/jama.2016.11409 , url =
- [2]
-
[3]
Novel clinical trial designs to improve the efficiency of research
Sessler, Daniel I and Myles, Paul S. Novel clinical trial designs to improve the efficiency of research. Anesthesiology
-
[4]
Campbell, Donald T. , doi =. Factors relevant to the validity of experiments in social settings , url =. Psychological Bulletin , keywords =
-
[5]
Annals of internal medicine , volume=
The revised CONSORT statement for reporting randomized trials: explanation and elaboration , author=. Annals of internal medicine , volume=. 2001 , publisher=
work page 2001
- [6]
- [7]
- [8]
-
[9]
Variable selection for propensity score models , Author =. Am J Epidemiol , Year =
-
[10]
Jerzy Splawa-Neyman and D. M. Dabrowska and T. P. Speed , title =. Statistical Science , number =. 1990 , doi =
work page 1990
-
[11]
Yilun Du, Shuang Li, Antonio Torralba, Joshua B Tenenbaum, and Igor Mordatch
Chernozhukov, Victor and Chetverikov, Denis and Demirer, Mert and Duflo, Esther and Hansen, Christian and Newey, Whitney and Robins, James , title = ". The Econometrics Journal , volume =. 2018 , month =. doi:10.1111/ectj.12097 , url =
-
[12]
Efficient estimation of average treatment effects using the estimated propensity score , Author =. Econometrica , Year =
-
[13]
Imbens and Stefan Wager , title=
Susan Athey and Guido W. Imbens and Stefan Wager , title=. Journal of the Royal Statistical Society Series B , year=2018, volume=. doi:10.1111/rssb.12268 , abstract=
-
[14]
Journal of the Royal Statistical Society Series B , volume =. 2013 , pages =
work page 2013
-
[15]
American journal of epidemiology , volume=
Using big data to emulate a target trial when a randomized trial is not available , author=. American journal of epidemiology , volume=. 2016 , publisher=
work page 2016
-
[16]
Mathematical Modelling , Year =
A new approach to causal inference in mortality studies with a sustained exposure period--application to control of the healthy worker survivor effect , Author =. Mathematical Modelling , Year =
-
[17]
The limitations of using randomised controlled trials as a basis for developing treatment guidelines
Mulder, Roger and Singh, Ajeet B and Hamilton, Amber and Das, Pritha and Outhred, Tim and Morris, Grace and Bassett, Darryl and Baune, Bernhard T and Berk, Michael and Boyce, Philip and Lyndon, Bill and Parker, Gordon and Malhi, Gin S. The limitations of using randomised controlled trials as a basis for developing treatment guidelines. Evid. Based. Ment. Health
-
[18]
Expert Opinion on Pharmacotherapy , year=
The failure of torcetrapib: is there a case for independent preclinical and clinical testing? , author=. Expert Opinion on Pharmacotherapy , year=
-
[19]
Van Spall, Harriette G. C. and Toren, Andrew and Kiss, Alex and Fowler, Robert A. , title =. JAMA , volume =. 2007 , month =. doi:10.1001/jama.297.11.1233 , url =
-
[20]
Edelman, Steven V. and Polonsky, William H. , title =. Diabetes Care , volume =. 2017 , month =. doi:10.2337/dc16-1974 , url =
-
[21]
and Tuttle, Edward and Tan, Ruo-Ding and Huynh, Johnny and Yee, John and Edelman, Steven V
Carls, Ginger S. and Tuttle, Edward and Tan, Ruo-Ding and Huynh, Johnny and Yee, John and Edelman, Steven V. and Polonsky, William H. , title =. Diabetes Care , volume =. 2017 , month =. doi:10.2337/dc16-2725 , url =
- [22]
-
[23]
Kennedy-Martin, Tessa and Curtis, Sarah and Faries, Douglas and Robinson, Susan and Johnston, Joseph. A literature review on the representativeness of randomized controlled trial samples and implications for the external validity of trial results. Trials
-
[24]
Journal of clinical epidemiology , year=
PRECIS-2 in perspective: what is next for pragmatic trials? , author=. Journal of clinical epidemiology , year=
-
[25]
The annals of statistics , pages=
Large sample behaviour of the product-limit estimator on the whole line , author=. The annals of statistics , pages=. 1983 , publisher=
work page 1983
-
[26]
BMC medical research methodology , volume=
Restricted mean survival time: an alternative to the hazard ratio for the design and analysis of randomized trials with a time-to-event outcome , author=. BMC medical research methodology , volume=. 2013 , publisher=
work page 2013
-
[27]
Hern \'a n, M A and Brumback, B and Robins, J M. Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men. Epidemiology
-
[28]
Programme CANTO - Recherche sur le cancer du sein pour r\'eduire la toxicit\'e du traitement , year =
-
[29]
Vaz-Luis, Ines and Cottu, Paul and Mesleard, Christel and Martin, Anne Laure and Dumas, Agnes and Dauchy, Sarah and Tredan, Olivier and Levy, Christelle and Adnet, Johan and Rousseau Tsangaris, Marina and Andre, Fabrice and Arveux, Patrick , title =. ESMO Open , year =
- [30]
-
[31]
and Gran, Jon Michael and Seaman, Shaun R
Keogh, Ruth H. and Gran, Jon Michael and Seaman, Shaun R. and Davies, Gwyneth and Vansteelandt, Stijn , title =. Statistics in Medicine , year =
-
[32]
Zhang, Mingyuan and Joffe, Marshall M. and Small, Dylan S. , title =. The Annals of Statistics , year =
- [33]
-
[34]
Hemant Ishwaran and Udaya B. Kogalur , keywords =. Consistency of random survival forests , journal =. 2010 , issn =. doi:https://doi.org/10.1016/j.spl.2010.02.020 , url =
-
[35]
American Journal of Cancer Research , year =
Delgado, Amanda and Guddati, Achuta Kumar , title =. American Journal of Cancer Research , year =
-
[36]
Journal of Thoracic Oncology , year =
Le-Rademacher, Jennifer and Wang, Xiaofei , title =. Journal of Thoracic Oncology , year =
-
[37]
Pharmacoepidemiology and Drug Safety , year =
Suissa, Samy , title =. Pharmacoepidemiology and Drug Safety , year =
-
[38]
Hern. Specifying a Target Trial Prevents Immortal Time Bias and Other Self-Inflicted Injuries in Observational Analyses , journal =. 2016 , volume =
work page 2016
-
[39]
A Second Chance to Get Causal Inference Right: A Classification of Data Science Tasks , journal =
Hern. A Second Chance to Get Causal Inference Right: A Classification of Data Science Tasks , journal =. 2019 , volume =
work page 2019
-
[40]
Review of the Reporting of Survival Analyses within Randomised Controlled Trials and the Implications for Meta-Analysis , year =. PLOS ONE , publisher =. doi:10.1371/journal.pone.0154870 , author =
-
[41]
Patricia M. Grambsch and Terry M. Therneau , title =. Biometrika , year =
- [42]
-
[43]
Literature review on methods for addressing non-proportional hazards in oncology clinical trials , year =
-
[44]
Howe, Chanelle J. and Cole, Stephen R. and Lau, Bryan and Napravnik, Sonia and Eron, Joseph J. , title =. Epidemiology , year =
-
[45]
Lesko, Catherine R. and Edwards, Jessie K. and Cole, Stephen R. and Moore, Richard D. and Lau, Bryan , title =. American Journal of Epidemiology , year =
-
[46]
Lesko, Catherine R. and Edwards, Jessie K. and Moore, Richard D. and Lau, Bryan , title =. Epidemiology , year =
-
[47]
Fleming, Thomas R. and Rothmann, Mark D. and Lu, Huaning L. , title =. Journal of Clinical Oncology , year =
-
[48]
Calibrating confounding strength in sensitivity models for weighting estimators: a comparative review and a new method , author=. 2026 , eprint=
work page 2026
-
[49]
Van Lancker, Kelly and Dukes, Oliver and Vansteelandt, Stijn , title =. Biometrics , volume =. doi:https://doi.org/10.1111/biom.13889 , url =. https://onlinelibrary.wiley.com/doi/pdf/10.1111/biom.13889 , abstract =
-
[50]
Covariate adjustment in randomized controlled trials: General concepts and practical considerations
Van Lancker, Kelly and Bretz, Frank and Dukes, Oliver. Covariate adjustment in randomized controlled trials: General concepts and practical considerations. Clin. Trials
-
[51]
Toward causal inference with interference
Hudgens, Michael G and Halloran, M Elizabeth. Toward causal inference with interference. J. Am. Stat. Assoc
-
[52]
Causal Inference: A Tale of Three Frameworks , volume =
Wang, Linbo and Richardson, Thomas and Robins, James , year =. Causal Inference: A Tale of Three Frameworks , volume =. Journal of Data Science , doi =
-
[53]
Loudon, Kirsty and Treweek, Shaun and Sullivan, Frank and Donnan, Peter and Thorpe, Kevin E and Zwarenstein, Merrick , title =. 2015 , doi =. https://www.bmj.com/content/350/bmj.h2147.full.pdf , journal =
work page 2015
-
[54]
Assessing the generalizability of randomized trial results to target populations
Stuart, Elizabeth A and Bradshaw, Catherine P and Leaf, Philip J. Assessing the generalizability of randomized trial results to target populations. Prev. Sci
-
[55]
Annals of Translational Medicine , volume =
Zhongheng Zhang , title =. Annals of Translational Medicine , volume =. 2016 , keywords =
work page 2016
-
[56]
Borrowing strength: Theory powering applications--a Festschrift for Lawrence D
High-dimensional variable selection for Cox’s proportional hazards model , author=. Borrowing strength: Theory powering applications--a Festschrift for Lawrence D. Brown , volume=. 2010 , publisher=
work page 2010
-
[57]
Kristensen, G. and Perren, T. and Qian, W. and Pfisterer, J. and Ledermann, J. A. and Joly, F. and Carey, M. S. and Beale, P. J. and Cervantes, A. and Oza, A. M. and null, null , title =. Journal of Clinical Oncology , volume =. 2011 , doi =. https://ascopubs.org/doi/pdf/10.1200/jco.2011.29.18_suppl.lba5006 , abstract =
-
[58]
Journal of the Royal Statistical Society: Series B , year =
Jiang, Runchao and Lu, Wenbin and Song, Rui and Davidian, Marie , title =. Journal of the Royal Statistical Society: Series B , year =
-
[59]
Cho, Hunyong and Holloway, Shannon T. and Moodie, Erica E. M. and Kosorok, Michael R. , title =. Biometrika , year =
-
[60]
Tony S. Mok and Yi-Long Wu and Sumitra Thongprasert and Chih-Hsin Yang and Da-Tong Chu and Nagahiro Saijo and Patrapim Sunpaweravong and Baohui Han and Benjamin Margono and Yukito Ichinose and Yutaka Nishiwaki and Yuichiro Ohe and Jin-Ji Yang and Busyamas Chewaskulyong and Haiyi Jiang and Emma L. Duffield and Claire L. Watkins and Alison A. Armour and Mas...
-
[61]
How hazard ratios can mislead and why it matters in practice
Dumas, Elise and Stensrud, Mats J. How hazard ratios can mislead and why it matters in practice. Eur. J. Epidemiol
-
[62]
arXiv preprint arXiv:2208.07614 , year=
Reweighting the RCT for generalization: finite sample error and variable selection , author=. arXiv preprint arXiv:2208.07614 , year=
-
[63]
Pharmaceutical Statistics , volume =
Wang, Jixian , title =. Pharmaceutical Statistics , volume =. doi:https://doi.org/10.1002/pst.1834 , url =. https://onlinelibrary.wiley.com/doi/pdf/10.1002/pst.1834 , abstract =
-
[64]
Biometrical Journal , volume =
Witte, Janine and Didelez, Vanessa , title =. Biometrical Journal , volume =. doi:https://doi.org/10.1002/bimj.201700294 , url =. https://onlinelibrary.wiley.com/doi/pdf/10.1002/bimj.201700294 , abstract =
-
[65]
Sören R. Künzel and Jasjeet S. Sekhon and Peter J. Bickel and Bin Yu , title =. Proceedings of the National Academy of Sciences , volume =. 2019 , doi =
work page 2019
-
[66]
Journal of the Royal Statistical Society Series B , year=2020, volume=
Carlos Cinelli and Chad Hazlett , title=. Journal of the Royal Statistical Society Series B , year=2020, volume=. doi:10.1111/rssb.12348 , abstract=
-
[67]
Nonparametric Inference for a Family of Counting Processes , urldate =
Odd Aalen , journal =. Nonparametric Inference for a Family of Counting Processes , urldate =
-
[68]
Wayne Nelson , title =. Technometrics , volume =. 1972 , publisher =. doi:10.1080/00401706.1972.10488991 , URL =
-
[69]
Journal of applied mechanics , year=
A statistical distribution function of wide applicability , author=. Journal of applied mechanics , year=
-
[70]
Journal of the Royal Statistical Society Series C: Applied Statistics , volume=
Log-logistic regression models for survival data , author=. Journal of the Royal Statistical Society Series C: Applied Statistics , volume=. 1983 , publisher=
work page 1983
-
[71]
Lognormal distributions , author=. Nature , volume=. 1945 , publisher=
work page 1945
-
[72]
B. Epstein and M. Sobel , journal =. Some Theorems Relevant to Life Testing from an Exponential Distribution , urldate =
-
[73]
Proximal Survival Analysis to Handle Dependent Right Censoring , author=. 2022 , url=
work page 2022
-
[74]
Mansukhani, Raoul and Frimley, Lauren and Shakur-Still, Haleema and Sharples, Linda and Roberts, Ian , year =. Accuracy of time to treatment estimates in the CRASH-3 clinical trial: impact on the trial results , volume =. Trials , doi =
-
[75]
Statistics in Medicine , volume =
Bender, Ralf and Augustin, Thomas and Blettner, Maria , title =. Statistics in Medicine , volume =. doi:10.1002/sim.2059 , url =. https://onlinelibrary.wiley.com/doi/pdf/10.1002/sim.2059 , abstract =
-
[76]
Probability theory and related fields , volume=
Two-sided bias bound of the Kaplan-Meier estimator , author=. Probability theory and related fields , volume=. 1988 , publisher=
work page 1988
-
[77]
Emerging Themes in Epidemiology , volume =
On the collapsibility of measures of effect in the counterfactual causal framework , author =. Emerging Themes in Epidemiology , volume =. 2019 , month =
work page 2019
-
[78]
Journal of the American Statistical Association , volume=
Estimation of Regression Coefficients When Some Regressors are not Always Observed , author=. Journal of the American Statistical Association , volume=. 1994 , publisher=
work page 1994
-
[79]
Statistical Science , volume =
Confounding and collapsibility in causal inference , author =. Statistical Science , volume =. 1999 , month =
work page 1999
-
[80]
Altman, Douglas G and Bland, J Martin , title =. 1998 , doi =. https://www.bmj.com/content/317/7156/468.1.full.pdf , journal =
work page 1998
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.