pith. machine review for the scientific record. sign in

arxiv: 2604.19736 · v1 · submitted 2026-04-21 · 💻 cs.CV

Recognition: unknown

Generative Drifting for Conditional Medical Image Generation

Andreas Maier, Lina G\"olz, Siyuan Mei, Weiwen Wu, Yan Xia, Zirong Li

Authors on Pith no claims yet

Pith reviewed 2026-05-10 02:17 UTC · model grok-4.3

classification 💻 cs.CV
keywords generative driftingconditional medical image generation3D medical imagingMRI-to-CT synthesissparse-view CT reconstructionattractive-repulsive driftmulti-level feature bank
0
0 comments X

The pith

GDM reformulates conditional 3D medical image generation as a drifting process that jointly optimizes distribution plausibility and patient fidelity in one inference step.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes a generative drifting model to solve the core tension in medical image synthesis between fast one-step inference and the need for both realistic overall distributions and accurate patient-specific details. Existing GAN, flow, and diffusion approaches often sacrifice one for the other in high-dimensional 3D volumes. GDM introduces an attractive-repulsive drift that pulls the generated distribution toward the target while using a multi-level feature bank from a foundation encoder to compute stable drift fields. A gradient coordination step further balances the competing objectives in the shared output space. The result is a method that reports better trade-offs on two standard tasks: MRI-to-CT translation and sparse-view CT reconstruction.

Core claim

GDM extends drifting to 3D medical imaging by constructing an attractive-repulsive drift that minimizes discrepancy between the generator pushforward and the target distribution, supported by a multi-level feature bank from a medical foundation encoder for global-local-spatial affinity estimation and by gradient coordination in the shared output space to balance distribution-level and fidelity-oriented objectives. On MRI-to-CT synthesis and sparse-view CT reconstruction, the framework consistently outperforms GAN-based, flow-matching-based, SDE-based generative models, and supervised regression methods while improving the joint balance among anatomical fidelity, quantitative reliability, and

What carries the argument

Attractive-repulsive drift supported by a multi-level feature bank extracted from a medical foundation encoder, which supplies reliable affinity estimates and drifting fields across complementary global, local, and spatial representations.

If this is right

  • The model retains single-step inference while still promoting both distribution-level plausibility and patient-specific fidelity.
  • Gradient coordination in the shared output space improves optimization stability when distribution and fidelity objectives compete.
  • Performance gains hold across two representative clinical tasks: MRI-to-CT synthesis and sparse-view CT reconstruction.
  • The framework is compatible with a range of backbone architectures because the drifting mechanism operates on top of any generator that produces volumetric outputs.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same drifting construction could be tested on other paired 3D modalities such as PET-CT or ultrasound-CT without requiring new loss terms.
  • Because the feature bank is drawn from a pretrained foundation encoder, the method may reduce the volume of paired training data needed for new clinical sites.
  • If the drift fields prove interpretable, they could be inspected post hoc to identify which anatomical regions drive the largest corrections during generation.

Load-bearing premise

The multi-level feature bank from the medical foundation encoder supplies reliable affinity estimates and drifting fields that remain stable when learning on 3D volumetric data.

What would settle it

A controlled experiment on a held-out 3D medical synthesis task in which GDM produces lower anatomical fidelity scores or visibly unstable drift fields compared with a strong regression baseline would falsify the central claim.

Figures

Figures reproduced from arXiv: 2604.19736 by Andreas Maier, Lina G\"olz, Siyuan Mei, Weiwen Wu, Yan Xia, Zirong Li.

Figure 1
Figure 1. Figure 1: Illustration of trade-off among inference efficiency, patient-specific [PITH_FULL_IMAGE:figures/full_fig_p001_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Overview of the GDM framework. During training, the generator is jointly optimized by image-domain fidelity supervision and drifting-based [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Computation of the distribution drifting field. Positive and negative [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Illustration of the multi-level feature bank used for drifting. Local [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: MRI-to-CT synthesis results. Rows 1–2 show an axial case (WW/WL = 1500/-200 HU), and rows 3–4 show a coronal case (WW/WL = 1400/-500 [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Segmentation on synthesized CT volumes from the SynthRAD2025 [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 8
Figure 8. Figure 8: Residual noise power spectrum (NPS) analysis [35]. The top row corresponds to MRI-to-CT synthesis and the bottom row to SVCT reconstruction. [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: SVCT reconstruction results from 60 views. Rows 1–2 show axial views (WW/WL = 350/40 HU), and rows 3–4 show coronal views (WW/WL = [PITH_FULL_IMAGE:figures/full_fig_p008_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Intensity profiles along the indicated line in the coronal ROI of [PITH_FULL_IMAGE:figures/full_fig_p008_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Ablation of drifting and MGDA on SVCT reconstruction. Without [PITH_FULL_IMAGE:figures/full_fig_p009_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Ablation of drifting temperature τ ∈ {0.004, 0.01, 0.04} on syn￾thesis (top) and reconstruction (bottom). Larger τ yields stronger generative effects and sharper appearance but may increase structural instability, whereas smaller τ produces over-smoothed outputs. TABLE V ABLATION OF DRIFTING TEMPERATURE τ . PARENTHETICAL VALUES DENOTE DIFFERENCES FROM GDM. τ MRI-to-CT Synthesis SVCT Reconstruction MAE↓ Di… view at source ↗
read the original abstract

Conditional medical image generation plays an important role in many clinically relevant imaging tasks. However, existing methods still face a fundamental challenge in balancing inference efficiency, patient-specific fidelity, and distribution-level plausibility, particularly in high-dimensional 3D medical imaging. In this work, we propose GDM, a generative drifting framework that reformulates deterministic medical image prediction as a multi-objective learning problem to jointly promote distribution-level plausibility and patient-specific fidelity while retaining one-step inference. GDM extends drifting to 3D medical imaging through an attractive-repulsive drift that minimizes the discrepancy between the generator pushforward and the target distribution. To enable stable drifting-based learning in 3D volumetric data, GDM constructs a multi-level feature bank from a medical foundation encoder to support reliable affinity estimation and drifting field computation across complementary global, local, and spatial representations. In addition, a gradient coordination strategy in the shared output space improves optimization balance under competing distribution-level and fidelity-oriented objectives. We evaluate the proposed framework on two representative tasks, MRI-to-CT synthesis and sparse-view CT reconstruction. Experimental results show that GDM consistently outperforms a wide range of baselines, including GAN-based, flow-matching-based, and SDE-based generative models, as well as supervised regression methods, while improving the balance among anatomical fidelity, quantitative reliability, perceptual realism, and inference efficiency. These findings suggest that GDM provides a practical and effective framework for conditional 3D medical image generation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The manuscript proposes Generative Drifting Model (GDM), a framework that reformulates conditional 3D medical image generation (e.g., MRI-to-CT synthesis and sparse-view CT reconstruction) as a multi-objective problem. It introduces an attractive-repulsive drift to minimize pushforward-to-target discrepancy, supported by a multi-level feature bank extracted from a medical foundation encoder for affinity estimation across global/local/spatial representations, plus a gradient coordination strategy in shared output space. The work claims one-step inference with consistent outperformance over GAN-, flow-matching-, SDE-based, and supervised regression baselines while improving balance among anatomical fidelity, quantitative reliability, perceptual realism, and efficiency.

Significance. If the reported gains prove robust and reproducible, GDM could meaningfully advance practical conditional generation for high-dimensional medical volumes by combining distribution-level plausibility with patient-specific fidelity under efficient inference. The extension of drifting via foundation-model features is a targeted contribution, but its significance hinges on whether the empirical advantages are supported by detailed metrics, ablations, and stability checks rather than qualitative assertions alone.

major comments (1)
  1. [Methods (multi-level feature bank and drifting field computation)] The central claim of stable 3D drifting and consistent outperformance rests on the multi-level feature bank (global, local, and spatial representations from the medical foundation encoder) producing reliable affinities for the attractive-repulsive drift field. The manuscript provides no ablation studies, sensitivity analysis to imaging artifacts (e.g., intensity inhomogeneity), or quantitative verification that this component outperforms simpler feature extractors; without such evidence the load-bearing assumption remains untested and the performance claims cannot be fully evaluated.
minor comments (2)
  1. [Abstract] The abstract asserts 'consistent outperformance' and 'improved balance' without any numerical metrics, error bars, or dataset sizes; a brief quantitative summary should be added for clarity even though the full results section presumably contains them.
  2. [Methods] Notation for the drifting field, affinity estimation, and gradient coordination should be introduced with explicit equations and variable definitions in the methods to aid reproducibility.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the thoughtful review and for recognizing the potential of GDM to advance conditional 3D medical image generation. The feedback on the multi-level feature bank is valuable and highlights an area where additional evidence can strengthen the manuscript. We address the major comment below and will revise the paper accordingly to improve clarity and empirical support.

read point-by-point responses
  1. Referee: [Methods (multi-level feature bank and drifting field computation)] The central claim of stable 3D drifting and consistent outperformance rests on the multi-level feature bank (global, local, and spatial representations from the medical foundation encoder) producing reliable affinities for the attractive-repulsive drift field. The manuscript provides no ablation studies, sensitivity analysis to imaging artifacts (e.g., intensity inhomogeneity), or quantitative verification that this component outperforms simpler feature extractors; without such evidence the load-bearing assumption remains untested and the performance claims cannot be fully evaluated.

    Authors: We agree that dedicated validation of the multi-level feature bank would strengthen the manuscript. The current version emphasizes the overall framework, end-to-end results on MRI-to-CT and sparse-view CT tasks, and comparisons against GAN, flow-matching, SDE, and regression baselines, but does not include isolated ablations of the feature bank or explicit sensitivity tests to artifacts such as intensity inhomogeneity. In the revised manuscript we will add (i) ablation studies replacing the multi-level bank with simpler single-level or non-foundation encoders while keeping all other components fixed, (ii) quantitative metrics (e.g., affinity correlation, drift-field stability, and downstream generation quality) demonstrating the benefit of the multi-level design, and (iii) sensitivity analyses on subsets of the data exhibiting intensity inhomogeneity and other common artifacts. These additions will be presented with tables and figures to allow direct evaluation of the component's contribution. revision: yes

Circularity Check

0 steps flagged

No circularity: derivation chain is self-contained as an independent extension

full rationale

The abstract and description present GDM as a reformulation of deterministic prediction into multi-objective learning with an attractive-repulsive drift minimizing pushforward-to-target discrepancy, enabled by a multi-level feature bank from a medical foundation encoder plus gradient coordination. No equations, fitted parameters, or self-citations are shown that reduce any claimed result (e.g., outperformance or stability) to quantities defined by the paper's own inputs by construction. The central components are introduced as extensions rather than tautological renamings or self-referential fits, and experimental claims stand apart from the framework definition. This is the normal case of a non-circular methodological proposal.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review yields no explicit free parameters, axioms, or invented entities; the framework relies on standard generative modeling concepts plus the newly combined drifting and feature-bank mechanisms.

pith-pipeline@v0.9.0 · 5569 in / 1251 out tokens · 48520 ms · 2026-05-10T02:17:00.400433+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. One-Step Generative Modeling via Wasserstein Gradient Flows

    cs.LG 2026-05 conditional novelty 7.0

    W-Flow achieves state-of-the-art one-step ImageNet 256x256 generation at 1.29 FID by training a static neural network to follow a Wasserstein gradient flow that minimizes Sinkhorn divergence, delivering roughly 100x f...

Reference graph

Works this paper leans on

38 extracted references · 4 canonical work pages · cited by 1 Pith paper · 3 internal anchors

  1. [1]

    Diffboost: Enhancing medical image segmen- tation via text-guided diffusion model,

    Z. Zhang, L. Yao, B. Wang, D. Jha, G. Durak, E. Keles, A. Medetal- ibeyoglu, and U. Bagci, “Diffboost: Enhancing medical image segmen- tation via text-guided diffusion model,”IEEE Transactions on Medical Imaging, vol. 44, no. 9, pp. 3670–3682, 2024

  2. [2]

    Unsupervised medical image translation with adversarial diffusion models,

    M. ¨Ozbey, O. Dalmaz, S. U. Dar, H. A. Bedel, S ¸. ¨Ozturk, A. G ¨ung¨or, and T. Cukur, “Unsupervised medical image translation with adversarial diffusion models,”IEEE Transactions on Medical Imaging, vol. 42, no. 12, pp. 3524–3539, 2023

  3. [3]

    Cross- conditioned diffusion model for medical image to image translation,

    Z. Xing, S. Yang, S. Chen, T. Ye, Y . Yang, J. Qin, and L. Zhu, “Cross- conditioned diffusion model for medical image to image translation,” in International Conference on Medical Image Computing and Computer- Assisted Intervention. Springer, 2024, pp. 201–211

  4. [4]

    CT-sdm: A sampling diffu- sion model for sparse-view CT reconstruction across various sampling rates,

    L. Yang, J. Huang, G. Yang, and D. Zhang, “CT-sdm: A sampling diffu- sion model for sparse-view CT reconstruction across various sampling rates,”IEEE Transactions on Medical Imaging, 2025. 11

  5. [5]

    Regression is all you need for medical image translation,

    S. Rassmann, D. K ¨ugler, C. Ewert, and M. Reuter, “Regression is all you need for medical image translation,”IEEE Transactions on Medical Imaging, 2026

  6. [6]

    Tomographic foundation Model—FORCE: Flow-oriented reconstruction conditioning engine,

    W. Xia, C. Niu, and G. Wang, “Tomographic foundation Model—FORCE: Flow-oriented reconstruction conditioning engine,” IEEE Transactions on Medical Imaging, 2026

  7. [7]

    A new deep convolutional neural network design with efficient learning capability: Application to CT image synthesis from MRI,

    A. Bahrami, A. Karimian, E. Fatemizadeh, H. Arabi, and H. Zaidi, “A new deep convolutional neural network design with efficient learning capability: Application to CT image synthesis from MRI,”Medical Physics, vol. 47, no. 10, pp. 5158–5171, 2020

  8. [8]

    Low-dose CT with a residual encoder-decoder convolutional neural network,

    H. Chen, Y . Zhang, M. K. Kalra, F. Lin, Y . Chen, P. Liao, J. Zhou, and G. Wang, “Low-dose CT with a residual encoder-decoder convolutional neural network,”IEEE Transactions on Medical Imaging, vol. 36, no. 12, pp. 2524–2535, 2017

  9. [9]

    MRI-only based synthetic CT generation using dense cycle consistent generative adversarial networks,

    Y . Lei, J. Harms, T. Wang, Y . Liu, H.-K. Shu, A. B. Jani, W. J. Curran, H. Mao, T. Liu, and X. Yang, “MRI-only based synthetic CT generation using dense cycle consistent generative adversarial networks,”Medical Physics, vol. 46, no. 8, pp. 3565–3581, 2019

  10. [10]

    Yang Song, Liyue Shen, Lei Xing, and Stefano Ermon

    Y . Song, L. Shen, L. Xing, and S. Ermon, “Solving inverse problems in medical imaging with score-based generative models,”arXiv preprint arXiv:2111.08005, 2021

  11. [11]

    Dual-domain collaborative diffusion sampling for multi-source stationary computed tomography reconstruction,

    Z. Li, D. Chang, Z. Zhang, F. Luo, Q. Liu, J. Zhang, G. Yang, and W. Wu, “Dual-domain collaborative diffusion sampling for multi-source stationary computed tomography reconstruction,”IEEE Transactions on Medical Imaging, vol. 43, no. 10, pp. 3398–3411, 2024

  12. [12]

    Physics-informed score-based diffusion model for limited-angle reconstruction of cardiac computed tomography,

    S. Han, Y . Xu, D. Wang, B. Morovati, L. Zhou, J. S. Maltz, G. Wang, and H. Yu, “Physics-informed score-based diffusion model for limited-angle reconstruction of cardiac computed tomography,”IEEE Transactions on Medical Imaging, vol. 44, no. 9, pp. 3629–3640, 2024

  13. [13]

    Two-and-a-half order score-based model for solving 3D ill-posed inverse problems,

    Z. Li, Y . Wang, J. Zhang, W. Wu, and H. Yu, “Two-and-a-half order score-based model for solving 3D ill-posed inverse problems,”Comput- ers in Biology and Medicine, vol. 168, p. 107819, 2024

  14. [14]

    Generative Modeling via Drifting

    M. Deng, H. Li, T. Li, Y . Du, and K. He, “Generative modeling via drifting,”arXiv preprint arXiv:2602.04770, 2026

  15. [15]

    Deep con- volutional neural network for inverse problems in imaging,

    K. H. Jin, M. T. McCann, E. Froustey, and M. Unser, “Deep con- volutional neural network for inverse problems in imaging,”IEEE Transactions on Image Processing, vol. 26, no. 9, pp. 4509–4522, 2017

  16. [16]

    Image-to-image translation with conditional adversarial networks,

    P. Isola, J.-Y . Zhu, T. Zhou, and A. A. Efros, “Image-to-image translation with conditional adversarial networks,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 1125–1134

  17. [17]

    Swin unetr: Swin transformers for semantic segmentation of brain tumors in MRI images,

    A. Hatamizadeh, V . Nath, Y . Tang, D. Yang, H. R. Roth, and D. Xu, “Swin unetr: Swin transformers for semantic segmentation of brain tumors in MRI images,” inInternational MICCAI Brainlesion Workshop. Springer, 2021, pp. 272–284

  18. [18]

    A review of deep learning CT reconstruction from incomplete projection data,

    T. Wang, W. Xia, J. Lu, and Y . Zhang, “A review of deep learning CT reconstruction from incomplete projection data,”IEEE Transactions on Radiation and Plasma Medical Sciences, vol. 8, no. 2, pp. 138–152, 2023

  19. [19]

    Ea- GANs: edge-aware generative adversarial networks for cross-modality MR image synthesis,

    B. Yu, L. Zhou, L. Wang, Y . Shi, J. Fripp, and P. Bourgeat, “Ea- GANs: edge-aware generative adversarial networks for cross-modality MR image synthesis,”IEEE Transactions on Medical Imaging, vol. 38, no. 7, pp. 1750–1762, 2019

  20. [20]

    Generative adversarial network in medical imaging: A review,

    X. Yi, E. Walia, and P. Babyn, “Generative adversarial network in medical imaging: A review,”Medical Image Analysis, vol. 58, p. 101552, 2019

  21. [21]

    Which training methods for GANs do actually converge?

    L. Mescheder, A. Geiger, and S. Nowozin, “Which training methods for GANs do actually converge?” inInternational Conference on Machine Learning. PMLR, 2018, pp. 3481–3490

  22. [22]

    Flow Matching for Generative Modeling

    Y . Lipman, R. T. Chen, H. Ben-Hamu, M. Nickel, and M. Le, “Flow matching for generative modeling,”arXiv preprint arXiv:2210.02747, 2022

  23. [23]

    Solving 3D inverse problems using pre-trained 2D diffusion models,

    H. Chung, D. Ryu, M. T. McCann, M. L. Klasky, and J. C. Ye, “Solving 3D inverse problems using pre-trained 2D diffusion models,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 22 542–22 551

  24. [24]

    Fast and robust fixed-point algorithms for independent component analysis,

    A. Hyvarinen, “Fast and robust fixed-point algorithms for independent component analysis,”IEEE Transactions on Neural Networks, vol. 10, no. 3, pp. 626–634, 1999

  25. [25]

    MedV AE: Efficient automated interpretation of medical images with large-scale generalizable autoencoders,

    M. Varma, A. Kumar, R. V . der Sluijs, S. Ostmeier, L. Blankemeier, P. J. M. Chambon, C. Bluethgen, J. Prince, C. Langlotz, and A. S. Chaudhari, “MedV AE: Efficient automated interpretation of medical images with large-scale generalizable autoencoders,” inMedical Imaging with Deep Learning, 2025

  26. [26]

    Conflict-averse gradient descent for multi-task learning,

    B. Liu, X. Liu, X. Jin, P. Stone, and Q. Liu, “Conflict-averse gradient descent for multi-task learning,”Advances in neural information pro- cessing systems, vol. 34, pp. 18 878–18 890, 2021

  27. [27]

    Multi-task learning as multi-objective opti- mization,

    O. Sener and V . Koltun, “Multi-task learning as multi-objective opti- mization,”Advances in Neural Information Processing Systems, vol. 31, 2018

  28. [28]

    SynthRAD2025 grand challenge dataset: Generating synthetic CTs for radiotherapy from head to abdomen,

    A. Thummerer, E. van der Bijl, A. J. Galapon, F. Kamp, M. Savenije, C. Muijs, S. Aluwini, R. J. Steenbakkers, S. Beuel, M. P. Intvenet al., “SynthRAD2025 grand challenge dataset: Generating synthetic CTs for radiotherapy from head to abdomen,”Medical Physics, vol. 52, no. 7, p. e17981, 2025

  29. [29]

    Weighted res-unet for high-quality retina vessel segmentation,

    X. Xiao, S. Lian, Z. Luo, and S. Li, “Weighted res-unet for high-quality retina vessel segmentation,” in2018 9th International Conference on Information Technology in Medicine and Education (ITME). IEEE, 2018, pp. 327–331

  30. [30]

    Denoising diffusion probabilistic models,

    J. Ho, A. Jain, and P. Abbeel, “Denoising diffusion probabilistic models,” Advances in Neural Information Processing Systems, vol. 33, pp. 6840– 6851, 2020

  31. [31]

    Denoising Diffusion Implicit Models

    J. Song, C. Meng, and S. Ermon, “Denoising diffusion implicit models,” arXiv preprint arXiv:2010.02502, 2020

  32. [32]

    Dpm-solver++: Fast solver for guided sampling of diffusion probabilistic models,

    C. Lu, Y . Zhou, F. Bao, J. Chen, C. Li, and J. Zhu, “Dpm-solver++: Fast solver for guided sampling of diffusion probabilistic models,”Machine Intelligence Research, vol. 22, no. 4, pp. 730–751, 2025

  33. [33]

    Unipc: A unified predictor- corrector framework for fast sampling of diffusion models,

    W. Zhao, L. Bai, Y . Rao, J. Zhou, and J. Lu, “Unipc: A unified predictor- corrector framework for fast sampling of diffusion models,”Advances in Neural Information Processing Systems, vol. 36, pp. 49 842–49 869, 2023

  34. [34]

    Totalsegmen- tator: robust segmentation of 104 anatomic structures in CT images,

    J. Wasserthal, H.-C. Breit, M. T. Meyer, M. Pradella, D. Hinck, A. W. Sauter, T. Heye, D. T. Boll, J. Cyriac, S. Yanget al., “Totalsegmen- tator: robust segmentation of 104 anatomic structures in CT images,” Radiology: Artificial Intelligence, vol. 5, no. 5, p. e230024, 2023

  35. [35]

    The noise power spectrum of CT images,

    M. F. Kijewski and P. F. Judy, “The noise power spectrum of CT images,” Physics in Medicine & Biology, vol. 32, no. 5, pp. 565–575, 1987

  36. [36]

    Low-dose CT image and projection dataset,

    T. R. Moen, B. Chen, D. R. Holmes III, X. Duan, Z. Yu, L. Yu, S. Leng, J. G. Fletcher, and C. H. McCollough, “Low-dose CT image and projection dataset,”Medical Physics, vol. 48, no. 2, pp. 902–911, 2021

  37. [37]

    Single-slice rebinning method for helical cone-beam CT,

    F. Noo, M. Defrise, and R. Clackdoyle, “Single-slice rebinning method for helical cone-beam CT,”Physics in Medicine & Biology, vol. 44, no. 2, pp. 561–570, 1999

  38. [38]

    Mednext: transformer-driven scaling of convnets for medical image segmentation,

    S. Roy, G. Koehler, C. Ulrich, M. Baumgartner, J. Petersen, F. Isensee, P. F. Jaeger, and K. H. Maier-Hein, “Mednext: transformer-driven scaling of convnets for medical image segmentation,” inInternational Confer- ence on Medical Image Computing and Computer-assisted Intervention. Springer, 2023, pp. 405–415