Topological structure of radiation-induced DNA damage encodes coupled LET-oxygen signatures
Pith reviewed 2026-05-19 19:46 UTC · model grok-4.3
The pith
The topology of DNA double-strand breaks encodes the identity of the radiation particle, its position in the beam, and the local oxygen tension.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
DSB topology encodes particle identity, Spread-Out Bragg Peak position, and oxygen tension in a three-tier hierarchy. Particle identity and SOBP position are exactly decodable with balanced accuracy of 1.000. Oxygen-level classification degrades monotonically with LET from 0.517 for electrons to 0.189 for carbon distal SOBP, with a charge-driven non-monotonicity at the helium-to-carbon transition. The joint 49-class task achieves balanced accuracy 0.346. Per-class recall peaks at 0.5% O2, consistent with the OER curve inflection. Topological summaries dominate oxygen encoding.
What carries the argument
Persistent homology features extracted from the three-dimensional configuration of DSBs, including persistent entropy and landscape integrals, which are then used in Random Forest classification to decode the radiation conditions.
Load-bearing premise
The Voxel-Aware Oxygen model and TOPAS-nBio simulations generate DSB topologies whose persistent homology features accurately mirror real biological damage mechanisms for the tested ranges of LET and oxygen.
What would settle it
Laboratory experiments measuring actual DSB spatial distributions in irradiated cells under controlled particle beams and oxygen tensions that fail to reproduce the reported classification accuracies and monotonic degradation with LET.
Figures
read the original abstract
We present the first nuclear-scale persistent homology and Random Forest classification analysis of radiation-induced DNA double-strand break (DSB) topology across the clinical particle therapy range. Using TOPAS-nBio and the Voxel-Aware Oxygen model, we generated 2,450 simulated nuclei across 49 conditions (seven particle configurations, 0.2--70.7~keV/\textmu{}m; seven oxygen levels, 0.005--21\%~O$_2$) and extracted a 107-feature matrix across seven modalities. DSB topology encodes particle identity, Spread-Out Bragg Peak (SOBP) position, and oxygen tension in a three-tier hierarchy, with fidelity at each tier governed by the physical mechanism controlling it. Particle identity and SOBP position are exactly decodable (balanced accuracy = 1.000). Oxygen-level classification degrades monotonically with LET from 0.517 (electrons) to 0.189 (carbon distal SOBP), with a charge-driven non-monotonicity at the helium-to-carbon transition confirming that atomic number, not LET alone, governs topological discriminability. The joint 49-class task achieves balanced accuracy 0.346, seventeen times above chance. Per-class recall peaks universally at 0.5\%~O$_2$ (0.788--0.976 across all configurations), which is consistent with the OER curve inflection. Topological Summaries (persistent entropy, landscape integrals) dominate oxygen encoding at all LET ($\eta^2_{O_2} =\,$0.300--0.622). A partial-out test reveals two mechanistically separable channels: a count-mediated scale signal ($\eta^2_{O_2}$ survival ratio 0.062) and a count-independent shape signal preserved or enhanced in five of seven configurations (balanced accuracy survival ratio 1.011). Persistent entropy and landscape integrals, as novel radiobiological observables, provide a computational basis for characterizing oxygen-dependent damage topology in hypoxic tumor treatment planning.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a computational study applying persistent homology to extract topological features from DNA double-strand break (DSB) configurations generated by TOPAS-nBio Monte Carlo simulations that incorporate a Voxel-Aware Oxygen model. Across 2,450 simulated nuclei spanning seven particle/LET configurations (0.2–70.7 keV/μm) and seven oxygen levels (0.005–21 % O₂), a 107-feature matrix is fed to Random Forest classifiers. The central claims are that DSB topology encodes particle identity, SOBP position, and oxygen tension in a three-tier hierarchy, with balanced accuracy exactly 1.000 for particle identity and SOBP position, oxygen classification accuracy degrading monotonically with LET (0.517 for electrons to 0.189 for carbon distal SOBP), a charge-driven non-monotonicity at the helium–carbon transition, and the existence of separable count-mediated and count-independent shape signals.
Significance. If the simulation faithfully captures biological DSB geometry, the work introduces persistent entropy and landscape integrals as novel radiobiological observables that could help quantify LET–oxygen coupling in hypoxic tumor regions during particle therapy planning. The scale of the simulation campaign (2,450 nuclei, 49 conditions) and the separation of count-dependent versus shape-only contributions are strengths that would, once experimentally anchored, provide a falsifiable computational framework for testing damage-topology hypotheses.
major comments (2)
- [Methods (Voxel-Aware Oxygen model)] Methods section describing the Voxel-Aware Oxygen model: the model’s oxygen-diffusion and radical-scavenging rules are presented without any direct comparison to experimental measurements of DSB spatial clustering or track-core versus penumbra geometries across the tested LET range. Because the reported perfect classification accuracies, the monotonic oxygen degradation, and the helium–carbon non-monotonicity are all downstream of these simulated topologies, the absence of such validation leaves open the possibility that the three-tier hierarchy reflects model-specific artifacts rather than physical or biological signatures.
- [Results (classification performance)] Results section on classification performance: the balanced accuracy of 1.000 for particle identity and SOBP position is stated without accompanying standard deviations, cross-validation folds, or confusion matrices. Given that the 107-feature matrix is derived from a finite set of 2,450 nuclei and that Random Forest can overfit high-dimensional topological summaries, these details are required to establish that the claimed exact decodability is robust rather than an artifact of the particular train–test split.
minor comments (2)
- [Abstract] Abstract: the phrase “seventeen times above chance” for the 49-class task should be accompanied by the explicit chance level (1/49) for immediate clarity.
- [Methods (feature extraction)] The 107-feature matrix composition is described only at a high level; an explicit breakdown of how many features come from each of the seven modalities (e.g., persistent entropy versus landscape integrals) would aid reproducibility.
Simulated Author's Rebuttal
We thank the referee for the thoughtful and constructive comments, which highlight important aspects of model validation and statistical reporting. We address each major comment below and will incorporate revisions to strengthen the manuscript.
read point-by-point responses
-
Referee: [Methods (Voxel-Aware Oxygen model)] Methods section describing the Voxel-Aware Oxygen model: the model’s oxygen-diffusion and radical-scavenging rules are presented without any direct comparison to experimental measurements of DSB spatial clustering or track-core versus penumbra geometries across the tested LET range. Because the reported perfect classification accuracies, the monotonic oxygen degradation, and the helium–carbon non-monotonicity are all downstream of these simulated topologies, the absence of such validation leaves open the possibility that the three-tier hierarchy reflects model-specific artifacts rather than physical or biological signatures.
Authors: We agree that direct experimental anchoring of the simulated DSB geometries would strengthen interpretation of the topological signatures. The Voxel-Aware Oxygen model extends established radical-diffusion and scavenging frameworks already implemented in TOPAS-nBio; these components have been benchmarked against measured oxygen enhancement ratios and track-structure data in prior literature. However, the manuscript does not include new side-by-side comparisons of simulated versus measured DSB spatial clustering or core-versus-penumbra distributions across the full LET range. In the revised version we will add a dedicated paragraph in the Methods and a limitations subsection in the Discussion that (i) cites the relevant experimental benchmarks for the underlying oxygen model, (ii) explicitly states that the reported three-tier hierarchy remains a computational prediction pending experimental validation, and (iii) outlines how future measurements of DSB topology could test the predicted LET–oxygen coupling. revision: yes
-
Referee: [Results (classification performance)] Results section on classification performance: the balanced accuracy of 1.000 for particle identity and SOBP position is stated without accompanying standard deviations, cross-validation folds, or confusion matrices. Given that the 107-feature matrix is derived from a finite set of 2,450 nuclei and that Random Forest can overfit high-dimensional topological summaries, these details are required to establish that the claimed exact decodability is robust rather than an artifact of the particular train–test split.
Authors: We acknowledge that reporting only the point estimate of balanced accuracy = 1.000 without variability measures leaves the robustness open to question. The classification was performed with Random Forest on the 107-feature matrix derived from the 2,450 nuclei. In the revised manuscript we will expand the Results section to report (i) 5-fold cross-validation results with mean balanced accuracy and standard deviation across folds, (ii) the exact train–test split ratios and random seeds used, and (iii) the full confusion matrices for both the particle-identity and SOBP-position tasks. These additions will demonstrate that the perfect decodability is reproducible across partitions and not an artifact of a single split. revision: yes
Circularity Check
No significant circularity detected in derivation chain
full rationale
The paper's core derivation proceeds from Monte Carlo generation of DSB topologies (TOPAS-nBio + Voxel-Aware Oxygen model) to extraction of 107 persistent-homology features across seven modalities, followed by standard Random Forest classification on the resulting feature matrix. Reported balanced accuracies (1.000 for particle identity/SOBP position, monotonic degradation for oxygen, joint 49-class accuracy 0.346) are direct empirical outputs of this pipeline applied to independently simulated data; no equations, fitted parameters, or self-citations reduce these quantities to tautological restatements of the inputs. The three-tier hierarchy and partial-out tests (count-mediated vs. count-independent signals) are likewise downstream statistical observations rather than definitional identities. The analysis is therefore self-contained against external benchmarks and receives the default non-circularity finding.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The Voxel-Aware Oxygen model accurately represents oxygen effects on DSB topology across 0.005--21% O2.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/AlexanderDuality.leanalexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We present the first nuclear-scale persistent homology and Random Forest classification analysis of radiation-induced DNA double-strand break (DSB) topology... extracted a 107-feature matrix across seven modalities... m7 Topological Summaries (persistent entropy, landscape integrals)
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
- [1]
-
[2]
Sokol, Olga and Durante, Marco , title =. Cancers , volume =. 2023 , doi =
work page 2023
-
[3]
Tinganelli, W. and Durante, M. and Hirayama, R. and Kr. Kill-painting of hypoxic tumours in charged particle therapy , journal =. 2015 , doi =
work page 2015
-
[4]
Bassler, Niels and Toftegaard, Jakob and L\"uhr, Armin and S. Acta Oncologica , volume =. 2014 , doi =
work page 2014
-
[5]
Alper, Tikvah and Howard-Flanders, Paul , title =. Nature , volume =. 1956 , doi =
work page 1956
-
[6]
Furusawa, Yoshiya and Fukutsu, Koichi and Aoki, Masami and Itsukaichi, Hiroshi and Eguchi-Kasai, Kiyomi and Ohara, Hitoshi and Yatagai, Fumio and Kanai, Tatsuaki and Ando, Koichi , title =. Radiation Research , volume =. 2000 , doi =
work page 2000
-
[7]
Grimes, D. Robert and Partridge, M. , title =. Biomedical Physics. 2015 , doi =
work page 2015
-
[8]
Scifoni, E. and Tinganelli, W. and Weyrather, W. K. and Durante, M. and Maier, A. and Kr. Including oxygen enhancement ratio in ion beam treatment planning:. Physics in Medicine. 2013 , doi =
work page 2013
-
[9]
Strigari, L. and Torriani, F. and Manganaro, L. and Inaniwa, T. and Dalmasso, F. and Cirio, R. and Attili, A. , title =. Physics in Medicine and Biology , year =
-
[10]
Chapman, J. D. , title =. New England Journal of Medicine , volume =. 1979 , doi =
work page 1979
-
[11]
Huang, Y. W. and Pan, C. Y. and Hsiao, Y. Y. and Chao, T. C. and Lee, C. C. and Tung, C. J. , title =. Physics in Medicine and Biology , volume =. 2015 , doi =
work page 2015
- [12]
-
[13]
Discrete & Computational Geometry , volume =
Zomorodian, Afra and Carlsson, Gunnar , title =. Discrete & Computational Geometry , volume =. 2005 , doi =
work page 2005
-
[14]
Journal of Applied and Computational Topology , volume =
Bauer, Ulrich , title =. Journal of Applied and Computational Topology , volume =. 2021 , doi =
work page 2021
-
[15]
Journal of Open Source Software , volume =
Tralie, Christopher and Saul, Nathaniel and Bar-On, Rann , title =. Journal of Open Source Software , volume =. 2018 , doi =
work page 2018
-
[16]
Journal of Machine Learning Research , volume =
Adams, Henry and Chepushtanova, Sofya and Emerson, Tegan and Hanson, Eric and Kirby, Michael and Motta, Francis and Neville, Rachel and Peterson, Chris and Shipman, Patrick and Ziegelmeier, Lori , title =. Journal of Machine Learning Research , volume =. 2017 , url =
work page 2017
-
[17]
Thomas, A. M. , title =. Journal of Applied and Computational Topology , volume =. 2025 , doi =
work page 2025
- [18]
-
[19]
and Hausmann, Michael , title =
Hofmann, Alexander and Krufczik, Matthias and Heermann, Dieter W. and Hausmann, Michael , title =. International Journal of Molecular Sciences , volume =. 2018 , doi =
work page 2018
-
[20]
Single molecule localization microscopy analyses of
Hausmann, Michael and Neitzel, Charlotte and Bobkova, Elizaveta and Nagel, David and Hofmann, Andreas and Chramko, Tatyana and Smirnova, Elena and Kope. Single molecule localization microscopy analyses of. Frontiers in Physics , volume =. 2020 , doi =
work page 2020
-
[21]
Hannes Hahn and Charlotte Neitzel and Olga Kope. Topological analysis of. Cancers , volume =. 2021 , doi =
work page 2021
- [22]
-
[23]
Nano-architecture of persistent focal
Scherthan, Heidemarie and Geiger, Beate and Ridinger, Daniel and M. Nano-architecture of persistent focal. Biomolecules , volume =. 2023 , doi =
work page 2023
-
[24]
Weidner, Jonas and Neitzel, Charlotte and Gote, Martin and Deck, Jeanette and K. Advanced image-free analysis of the nano-organization of chromatin and other biomolecules by single molecule localization microscopy (. Computational and Structural Biotechnology Journal , volume =. 2023 , doi =
work page 2023
-
[25]
T. Friedrich and K. Ilicic and C. Greubel and S. Girst and J. Reindl and M. Sammer and B. Schwarz and C. Siebenwirth and D. W. M. Walsh and T. E. Schmid and M. Scholz and G. Dollinger , title =. Scientific Reports , volume =. 2018 , doi =
work page 2018
-
[26]
Hu, Ankang and Zhou, Wanyi and Luo, Xiyu and Qiu, Rui and Li, Junli , title =. Radiation Research , year =
-
[27]
McNamara, Aidan and Geng, Chen and Turner, Ross and Mendez, Jose R. and Perl, Joseph and Held, Kathryn and Faddegon, Bruce and Paganetti, Harald and Schuemann, Jan , title =. Physica Medica , volume =. 2017 , doi =
work page 2017
-
[28]
Schuemann, J. and McNamara, A. L. and Ramos-Méndez, J. and Perl, J. and Held, K. D. and Paganetti, H. and others , title =. Radiation Research , year =
-
[29]
Schuemann, J. and McNamara, A. L. and Warmenhoven, J. W. and Henthorn, N. T. and Kirkby, K. J. and Merchant, M. J. and Ingram, S. and Paganetti, H. and Held, K. D. and Ramos-Mendez, J. and Faddegon, B. and Perl, J. and Goodhead, D. T. and Plante, I. and Rabus, H. and Nettelbeck, H. and Friedland, W. and Kundr. A new standard. Radiation Research , year =
- [30]
-
[31]
Hirayama, Ryoichi and Ito, Atsushi and Tomita, Masanori and Tsukada, Teruyo and Yatagai, Fumio and Noguchi, Miho and Matsumoto, Yoshitaka and Kase, Yuki and Ando, Koichi and Okayasu, Ryuichi and Furusawa, Yoshiya , title =. Radiation Research , volume =. 2009 , doi =
work page 2009
-
[32]
Breiman, Leo , title =. Machine Learning , volume =. 2001 , doi =
work page 2001
- [33]
-
[34]
Ambrosio, Susanna and Noviello, Anna and Di Fusco, Giovanni and Gorini, Francesca and Piscone, Anna and Amente, Stefano and Majello, Barbara , title =. Cancers , volume =. 2025 , doi =
work page 2025
- [35]
- [36]
-
[37]
International Journal of Molecular Sciences , volume =
Schäfer, Myriam and Hildenbrand, Georg and Hausmann, Michael , title =. International Journal of Molecular Sciences , volume =. 2024 , doi =
work page 2024
-
[38]
Pattern Recognition , volume =
Nieves Atienza and Rocio Gonzalez-Díaz and Manuel Soriano-Trigueros , title =. Pattern Recognition , volume =. 2020 , doi =
work page 2020
-
[39]
The Journal of Machine Learning Research , volume =
Bubenik, Peter , title =. The Journal of Machine Learning Research , volume =. 2015 , url =
work page 2015
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.