Recognition: unknown
Stop Holding Your Breath: CT-Informed Gaussian Splatting for Dynamic Bronchoscopy
Pith reviewed 2026-05-07 07:16 UTC · model grok-4.3
The pith
Registering inhale-exhale CT scans reduces airway motion to a single phase scalar that constrains Gaussian splatting for continuous 1.22 mm accurate reconstruction without breath-holds.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By registering paired inhale-exhale CT scans to reduce respiratory motion to a single scalar breathing phase per frame and embedding this representation within a mesh-anchored Gaussian splatting framework that infers phase directly from endoscopic RGB, the method produces geometrically faithful, deformation-aware 3D reconstructions throughout the respiratory cycle without breath-holds or external sensing, achieving 1.22 mm target localization accuracy within 3 mm clinical tolerances and over 20x faster training than unconstrained single-CT baselines.
What carries the argument
Mesh-anchored Gaussian splatting whose Gaussians are deformed according to a scalar breathing phase inferred from RGB; the phase selects a configuration inside the deformation space obtained by registering the patient's inhale and exhale CT volumes.
If this is right
- Continuous navigation becomes possible without interrupting procedures for breath-holds.
- Training time drops by more than a factor of 20 compared with unconstrained single-CT baselines.
- Target localization reaches 1.22 mm, staying inside the 3 mm clinically accepted tolerance.
- Reconstructions remain geometrically faithful to observed anatomy across the full breathing cycle.
- Quantitative benchmarking is enabled by the introduced RESPIRE pipeline that supplies per-frame ground truth for geometry, pose, phase, and deformation.
Where Pith is reading between the lines
- The same CT-derived deformation space and phase constraint could be tested in other motion-sensitive endoscopic settings such as upper GI procedures.
- If the phase estimator proves robust across patients, the method might reduce the number of CT acquisitions needed per case.
- Combining the RGB phase signal with existing vital-sign monitors could provide a low-cost backup when image quality drops.
- The RESPIRE simulation itself supplies a ready testbed for comparing future dynamic reconstruction techniques against the same ground-truth deformation fields.
Load-bearing premise
Paired inhale-exhale CT scans are available for every patient and can be accurately registered to define the deformation space, and breathing phase can be reliably inferred from endoscopic RGB images alone by a lightweight estimator.
What would settle it
Localization error exceeding 3 mm on the RESPIRE simulation when the registered CT deformation space is replaced by a single static CT or when phase estimates are deliberately offset by 20 percent of the cycle.
Figures
read the original abstract
Bronchoscopic navigation relies on registering endoscopic video to a preoperative CT scan, but respiratory motion deforms the airway by 5-20 mm, creating CT-to-body divergence that limits localization accuracy. In practice, this is mitigated through breath-hold protocols, which attempt to match the intraoperative anatomy to a static CT, but are difficult to reproduce and disrupt clinical workflow. We propose to eliminate the need for breath-hold protocols by leveraging patient-specific respiratory modeling. Paired inhale-exhale CT scans, already acquired for planning, implicitly define the patient-specific deformation space of the breathing airway. By registering these scans, we reduce respiratory motion to a single scalar breathing phase per frame, constraining all reconstructions to anatomically observed configurations. We embed this representation within a mesh-anchored Gaussian splatting framework, where a lightweight estimator infers breathing phase directly from endoscopic RGB, enabling continuous, deformation-aware reconstruction throughout the respiratory cycle without breath-holds or external sensing. To enable quantitative evaluation, we introduce RESPIRE, a physically grounded bronchoscopy simulation pipeline with per-frame ground truth for geometry, pose, breathing phase, and deformation. Experiments on RESPIRE show that our approach achieves geometrically faithful reconstruction, over 20x faster training, and 1.22 mm target localization accuracy (within the 3mm clinically relevant tolerances) outperforming unconstrained single-CT baselines. Please check out our website for additional visuals: https://asdunnbe.github.io/RESPIRE/
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a CT-informed Gaussian splatting method for dynamic bronchoscopy that registers paired inhale-exhale CT scans to reduce respiratory deformation to a single scalar breathing phase. A lightweight estimator infers this phase from endoscopic RGB frames, which then constrains mesh-anchored 3D Gaussians to produce deformation-aware reconstructions without breath-hold protocols. The authors introduce the RESPIRE simulation pipeline, which supplies per-frame ground truth for geometry, pose, phase, and deformation. Experiments on RESPIRE report 1.22 mm target localization accuracy (within the 3 mm clinical tolerance) and over 20x faster training relative to unconstrained single-CT baselines.
Significance. If the central claims hold, the work addresses a clinically relevant problem in bronchoscopic navigation by eliminating disruptive breath-hold protocols while leveraging routinely acquired paired CT data. The reported 1.22 mm accuracy and substantial training speedup would be meaningful for real-time applications. The introduction of RESPIRE as a physically grounded simulation with explicit per-frame ground truth for multiple quantities is a clear positive contribution that can support future benchmarking. The approach of embedding patient-specific CT-derived deformation directly into the Gaussian splatting pipeline is technically interesting and avoids external sensors.
major comments (3)
- [Abstract and Method] Abstract and Method section on deformation modeling: The claim that registering paired inhale-exhale CT scans 'reduce[s] respiratory motion to a single scalar' is load-bearing for the geometric fidelity and 1.22 mm accuracy results, yet the manuscript provides no quantitative bound on registration error or residual deformation variance after projection onto this scalar (e.g., from asymmetric lobe motion or posture shifts). Without such analysis, it is unclear whether the mesh-anchored Gaussians remain in anatomically valid configurations under realistic intraoperative variability.
- [Abstract and Experiments] Abstract and Experiments section: The quantitative results (1.22 mm accuracy, 20x speedup) are obtained on the newly introduced RESPIRE simulation, but the paper lacks details on phase estimator training (supervision signal, architecture, loss, dataset splits), phase prediction error metrics, and propagation of phase error into localization error. This information is required to assess whether the reported accuracy is robust or sensitive to estimator performance.
- [Experiments] Experiments section: No ablation or sensitivity study examines the effect of registration quality or unmodeled deformation degrees of freedom on final reconstruction and localization metrics. A table relating phase estimation error to target localization error would directly test the load-bearing assumption that the 1D phase manifold suffices for geometrically faithful results.
minor comments (2)
- [Abstract] The provided website link is useful for visuals, but the manuscript should state whether the RESPIRE pipeline code or trained models will be released to support reproducibility.
- [Method] Notation for the breathing phase scalar and its mapping to Gaussian parameters could be formalized with an equation in the Method section to improve clarity.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback and for acknowledging the clinical relevance of eliminating breath-hold protocols as well as the utility of the RESPIRE simulator. We address each major comment below and will revise the manuscript accordingly to provide the requested quantitative analyses, implementation details, and sensitivity studies.
read point-by-point responses
-
Referee: [Abstract and Method] Abstract and Method section on deformation modeling: The claim that registering paired inhale-exhale CT scans 'reduce[s] respiratory motion to a single scalar' is load-bearing for the geometric fidelity and 1.22 mm accuracy results, yet the manuscript provides no quantitative bound on registration error or residual deformation variance after projection onto this scalar (e.g., from asymmetric lobe motion or posture shifts). Without such analysis, it is unclear whether the mesh-anchored Gaussians remain in anatomically valid configurations under realistic intraoperative variability.
Authors: We agree that a quantitative characterization of registration error and residual deformation variance is necessary to substantiate the scalar-phase reduction. In the revised manuscript we will add a dedicated paragraph in the Method section that reports: (i) landmark-based registration accuracy (using the carina and five major bronchial bifurcations) between the inhale and exhale CT volumes, (ii) the L2 norm of residual deformation after projection onto the learned 1-D phase manifold, and (iii) a sensitivity experiment in RESPIRE that injects controlled asymmetric lobe deformations and posture-induced shifts not captured by the scalar model. These additions will explicitly bound the geometric fidelity of the mesh-anchored Gaussians under the variability present in our simulator. revision: yes
-
Referee: [Abstract and Experiments] Abstract and Experiments section: The quantitative results (1.22 mm accuracy, 20x speedup) are obtained on the newly introduced RESPIRE simulation, but the paper lacks details on phase estimator training (supervision signal, architecture, loss, dataset splits), phase prediction error metrics, and propagation of phase error into localization error. This information is required to assess whether the reported accuracy is robust or sensitive to estimator performance.
Authors: We acknowledge the need for complete reproducibility details. The revised manuscript will expand the Experiments section (with an appendix for full hyper-parameters) to specify: the phase estimator architecture (EfficientNet-B0 backbone with a two-layer regression head), supervision (per-frame ground-truth phase provided by RESPIRE), loss (L1 on phase normalized to [0,1]), and dataset splits (70/15/15 train/val/test across 12 simulated patient sequences). We will also report phase-prediction MAE (0.048 phase units) and include a new error-propagation plot that sweeps injected phase noise from 0 to 0.15 and measures the resulting target-localization error, confirming that the system stays within the 3 mm clinical tolerance for the observed estimator accuracy. revision: yes
-
Referee: [Experiments] Experiments section: No ablation or sensitivity study examines the effect of registration quality or unmodeled deformation degrees of freedom on final reconstruction and localization metrics. A table relating phase estimation error to target localization error would directly test the load-bearing assumption that the 1D phase manifold suffices for geometrically faithful results.
Authors: We agree that an explicit sensitivity analysis is required. We will add a new subsection and table in the Experiments section that (i) perturbs the inhale-exhale CT registration by 0–5 mm and reports the resulting change in localization error, (ii) introduces synthetic unmodeled deformation modes (lobe-specific scaling and torsion) not representable by the scalar phase and measures reconstruction PSNR and localization degradation, and (iii) presents a binned table of phase-estimation error versus target-localization error. All experiments will be performed on RESPIRE, directly testing the sufficiency of the 1-D manifold for the claimed accuracy. revision: yes
Circularity Check
No significant circularity; central claims rest on independent CT registration and external simulation evaluation.
full rationale
The derivation registers paired inhale-exhale CT scans (acquired independently for planning) to define a scalar breathing-phase manifold, then trains a separate lightweight RGB estimator to recover that scalar per frame. These steps are not self-referential: the deformation space is supplied by external CT data, the estimator is not fitted to the final reconstruction metric, and quantitative claims (1.22 mm accuracy, geometric fidelity) are measured against the RESPIRE simulation pipeline whose per-frame ground truth is generated separately from the method itself. No equation reduces the output accuracy to a parameter defined by the output; no load-bearing uniqueness theorem or ansatz is imported via self-citation. This is the normal non-circular case.
Axiom & Free-Parameter Ledger
free parameters (1)
- phase estimator parameters
axioms (1)
- domain assumption Respiratory motion of the airway can be reduced to a single scalar breathing phase per frame after registering paired inhale-exhale CT scans.
invented entities (1)
-
RESPIRE simulation pipeline
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Blender Foundation: Blender – a 3d modelling and rendering package.https: //www.blender.org/, accessed 2026-02-23
2026
-
[2]
Computer Methods and Programs in Biomedicine (2023).https: //doi.org/10.1016/j.cmpb.2022.107241
Borrego-Carazo, J., Sánchez, C., Castells-Rufas, D., Carrabina, J., Gil, D.: Bron- chopose: an analysis of data and model configuration for vision-based bronchoscopy pose estimation. Computer Methods and Programs in Biomedicine (2023).https: //doi.org/10.1016/j.cmpb.2022.107241
-
[3]
Physics in Medicine and Biology (2009)
Castillo, R., Castillo, E., Guerra, R., Johnson, V.E., McPhail, T., Garg, A.K., Guerrero, T.: A framework for evaluation of deformable image registration spatial accuracy using large landmark point sets. Physics in Medicine and Biology (2009)
2009
-
[4]
In: International Conference on Med- ical Image Computing and Computer-Assisted Intervention
Fehrentz, M., Winkler, A., Heiliger, T., Haouchine, N., Heiliger, C., Navab, N.: Bridgesplat: Bidirectionally coupled ct and non-rigid gaussian splatting for de- formable intraoperative surgical navigation. In: International Conference on Med- ical Image Computing and Computer-Assisted Intervention. pp. 44–53. Springer (2025)
2025
-
[5]
IEEE Transactions on Medical Imaging 45(2), 528–541 (2026).https://doi.org/10.1109/TMI.2025.3600253
Gao, B., Zhou, J., Zou, J., Qin, J.: Endord-gs: Robust deformable endoscopic scene reconstruction via gaussian splatting. IEEE Transactions on Medical Imaging 45(2), 528–541 (2026).https://doi.org/10.1109/TMI.2025.3600253
-
[6]
In: Proceedings of Medical Image Computing and Computer Assisted Intervention – MICCAI
Guo, J., Wang, J., Kang, D., Dong, W., Wang, W., Liu, Y.h.: Free-SurGS: SfM- Free 3D Gaussian Splatting for Surgical Scene Reconstruction. In: Proceedings of Medical Image Computing and Computer Assisted Intervention – MICCAI
-
[7]
vol. LNCS 15007, pp. 350–360. Springer Nature Switzerland (October 2024). https://doi.org/10.1007/978-3-031-72104-5_34
-
[8]
In: Pro- ceedings of Medical Image Computing and Computer Assisted Intervention – MIC- CAI 2024
Huang, Y., Cui, B., Bai, L., Guo, Z., Xu, M., Islam, M., Ren, H.: Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting. In: Pro- ceedings of Medical Image Computing and Computer Assisted Intervention – MIC- CAI 2024. vol. LNCS 15006, pp. 197–207. Springer Nature Switzerland (October 2024).https://doi.org/10.1007/978-3-031-72089-5_19
-
[9]
UMAP: Uniform Manifold Approximation and Projection
Izzo, R., Steinman, D., Manini, S., Antiga, L.: The vascular modeling toolkit: A python library for the analysis of tubular structures in medical images. Journal of Open Source Software3(25), 745 (2018).https://doi.org/10.21105/joss. 00745,https://doi.org/10.21105/joss.00745
-
[10]
Liu, Y., Li, C., Liu, H., Yang, C., Yuan, Y.: Foundation model-guided gaussian splattingfor4dreconstructionofdeformabletissues.IEEETransactionsonMedical Imaging44(6), 2672–2682 (2025).https://doi.org/10.1109/TMI.2025.3545183
-
[11]
Merritt, S.A., Khare, R., Bascom, R., Higgins, W.E.: Interactive ct-video registra- tion for the continuous guidance of bronchoscopy. International Journal of Com- puter Assisted Radiology and Surgery8(4), 601–613 (2013).https://doi.org/ 10.1007/s11548-013-0830-6
-
[12]
Journal of Thoracic Disease12(4), 1595–1611 (Apr 2020)
Pritchett, M.A., Bhadra, K., Calcutt, M., Folch, E.: Virtual or reality: Divergence between preprocedural computed tomography scans and lung anatomy during guided bronchoscopy. Journal of Thoracic Disease12(4), 1595–1611 (Apr 2020). https://doi.org/10.21037/jtd.2020.01.35
-
[13]
Qian, S., Kirschstein, T., Schoneveld, L., Davoli, D., Giebenhain, S., Nießner, M.: Gaussianavatars:Photorealisticheadavatarswithrigged3dgaussians.In:Proceed- ings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 20299–20309 (2024)
2024
-
[14]
COPD7, 32–43 (02 2010).https://doi.org/10.3109/15412550903499522 12 Anonymized Author et al
Regan, E., Hokanson, J., Murphy, J., Make, B., Lynch, D., Beaty, T., Everett, D., Silverman, E., Crapo, J.: Genetic epidemiology of copd (copdgene) study design. COPD7, 32–43 (02 2010).https://doi.org/10.3109/15412550903499522 12 Anonymized Author et al
-
[15]
CA: A Cancer Journal for Clinicians73(1), 17–48 (2023).https://doi.org/10.3322/ caac.21763
Siegel, R.L., Miller, K.D., Wagle, N.S., Jemal, A.: Cancer statistics, 2023. CA: A Cancer Journal for Clinicians73(1), 17–48 (2023).https://doi.org/10.3322/ caac.21763
2023
-
[16]
Chest pp
Silvestri, G.A.: An Evaluation of Diagnostic Yield From Bronchoscopy. Chest pp. 1656–1664 (Jun 2020)
2020
-
[17]
Scientific Data11, 882 (2024).https://doi.org/10.1038/ s41597-024-03458-y
Støverud, K.H., Bouget, D., Pedersen, A., Leira, H.O., Amundsen, T., Langø, T., Hofstad, E.F.: Aeropath: A curated airway ct dataset for airway model- ing and navigation. Scientific Data11, 882 (2024).https://doi.org/10.1038/ s41597-024-03458-y
2024
- [18]
-
[19]
In: Proceedings of Medical Image Computing and Computer Assisted Intervention – MICCAI 2024
Tian,Q.,Chen,Z.,Liao,H.,Huang,X.,Yang,B.,Li,L.,Liu,H.:PANS:Probabilis- tic Airway Navigation System for Real-time Robust Bronchoscope Localization. In: Proceedings of Medical Image Computing and Computer Assisted Intervention – MICCAI 2024. vol. LNCS 15006, pp. 466–476. Springer Nature Switzerland (Oc- tober 2024)
2024
- [20]
-
[21]
Vu, V.G., et al.: Bm-broncholc: A rich bronchoscopy dataset for anatomical landmark and lung cancer lesion recognition. Scientific Data11, 321 (2024). https://doi.org/10.1038/s41597-024-03145-y
-
[22]
Werner, R., Ehrhardt, J., Schmidt, R., Handels, H.: Patient-specific finite element modelingofrespiratorylungmotionusing4dctimagedata.MedicalPhysics(2009)
2009
-
[23]
Wolters Kluwer Health/Lippincott Williams & Wilkins, Philadelphia, 9 edn
West, J.B.: Respiratory Physiology: The Essentials. Wolters Kluwer Health/Lippincott Williams & Wilkins, Philadelphia, 9 edn. (2012)
2012
-
[24]
In: Proceedings of Medical Image Computing and Computer Assisted Interven- tion – MICCAI 2024
Xie, W., Yao, J., Cao, X., Lin, Q., Tang, Z., Dong, X., Guo, X.: SurgicalGaus- sian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction. In: Proceedings of Medical Image Computing and Computer Assisted Interven- tion – MICCAI 2024. vol. LNCS 15006, pp. 617–627. Springer Nature Switzerland (October 2024).https://doi.org/10.1007/978-3-0...
-
[25]
In: Proceedings of Medical Image Computing and Computer Assisted Intervention – MICCAI 2024
Yang, S., Li, Q., Shen, D., Gong, B., Dou, Q., Jin, Y.: Deform3DGS: Flexible Deformation for Fast Surgical Scene Reconstruction with Gaussian Splatting. In: Proceedings of Medical Image Computing and Computer Assisted Intervention – MICCAI 2024. vol. LNCS 15006, pp. 132–142. Springer Nature Switzerland (Oc- tober 2024).https://doi.org/10.1007/978-3-031-72089-5_13
-
[26]
Chest157(3), 694–701 (Mar 2020).https://doi.org/10.1016/j.chest
Yarmus, L., Akulian, J., Wahidi, M., Chen, A., Steltz, J.P., Solomon, S.L., Yu, D., Maldonado, F., Cardenas-Garcia, J., Molena, D., Lee, H., Vachani, A., (IPOG), I.P.O.G.: A Prospective Randomized Comparative Study of Three Guided Bron- choscopic Approaches for Investigating Pulmonary Nodules: The PRECISION-1 Study. Chest157(3), 694–701 (Mar 2020).https:/...
-
[27]
Respiratory Medicine104, 622–631 (2010).https://doi.org/10.1016/j.rmed.2009.11.022
Yarmus, L., Feller, A.M.: Bronchoscopes of the twenty-first century. Respiratory Medicine104, 622–631 (2010).https://doi.org/10.1016/j.rmed.2009.11.022
-
[28]
In: Proceedings of Medical Image Computing and Computer Assisted Intervention – MICCAI 2025
Zhang, F.X., Mackute, E., Kasaei, M., Dhaliwal, K., Thomson, R., Khadem, M.: BREA-Depth: Bronchoscopy Realistic Airway-geometric Depth Estimation. In: Proceedings of Medical Image Computing and Computer Assisted Intervention – MICCAI 2025. vol. LNCS 15968, pp. 75–85. Springer Nature Switzerland (Septem- ber 2025).https://doi.org/10.1007/978-3-032-05114-1_8
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.