Towards reconstructing experimental sparse-view X-ray CT data with diffusion models

Ezgi Demircan-Tureyen; Felix Lucka; Nelas J. Thomsen; Xinyuan Wang

arxiv: 2602.12755 · v3 · pith:5HTYBDGXnew · submitted 2026-02-13 · 💻 cs.CV

Towards reconstructing experimental sparse-view X-ray CT data with diffusion models

Nelas J. Thomsen , Xinyuan Wang , Felix Lucka , Ezgi Demircan-Tureyen This is my paper

Pith reviewed 2026-05-21 12:44 UTC · model grok-4.3

classification 💻 cs.CV

keywords diffusion modelssparse-view CTdomain shiftX-ray computed tomographyexperimental datainverse problemsCT reconstruction

0 comments

The pith

Diffusion priors trained on diverse synthetic data reconstruct experimental sparse-view CT scans effectively.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether diffusion models trained on synthetic data can serve as effective priors for reconstructing experimental sparse-view X-ray CT scans. It examines the effects of domain shift between synthetic training data and a physical phantom, as well as forward model mismatch in the reconstruction process. Results indicate that severe domain mismatch leads to model collapse, but diverse priors perform as well as or better than narrowly matched ones. Annealed likelihood weight schedules help mitigate artifacts from forward model mismatch while improving efficiency. This work highlights that benefits seen in synthetic settings do not directly carry over to real experimental data.

Core claim

Diffusion-based priors trained on synthetic image data sets with different degrees of domain shift can be used in a Decomposed Diffusion Sampling scheme to reconstruct sparse-view CT data from a physical phantom. Diverse priors match or exceed well-matched narrow priors, and annealed schedules mitigate forward model mismatch artifacts.

What carries the argument

Decomposed Diffusion Sampling scheme using diffusion priors on sparse-view CT data with annealed likelihood weight schedules to address domain and forward model mismatch.

Load-bearing premise

The physical phantom and the chosen synthetic training sets with varying degrees of domain shift are sufficient to represent the mismatch that would occur with real clinical or industrial CT data.

What would settle it

Applying the method to clinical patient CT data and observing persistent hallucinations or reconstruction failure despite diverse priors would show the assumption does not hold for practical cases.

Figures

Figures reproduced from arXiv: 2602.12755 by Ezgi Demircan-Tureyen, Felix Lucka, Nelas J. Thomsen, Xinyuan Wang.

**Figure 1.** Figure 1: PSNR (dB) as a function of the number of projections across four different test domains for the reconstructions obtained [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗

**Figure 3.** Figure 3: Line profiles from reconstructions shown in Fig. 2 [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: PSNR (dB) vs number of projections for three reso [PITH_FULL_IMAGE:figures/full_fig_p004_4.png] view at source ↗

read the original abstract

Diffusion-based image generators are promising priors for ill-posed inverse problems like sparse-view X-ray Computed Tomography (CT). As most studies consider synthetic data, it is not clear whether training data mismatch (``domain shift'') or forward model mismatch complicate their successful application to experimental data. We measured CT data from a physical phantom resembling the synthetic Shepp-Logan phantom and trained diffusion priors on synthetic image data sets with different degrees of domain shift towards it. Then, we employed the priors in a Decomposed Diffusion Sampling scheme on sparse-view CT data sets with increasing difficulty leading to the experimental data. Our results reveal that domain shift plays a nuanced role: while severe mismatch causes model collapse and hallucinations, diverse priors match or exceed well-matched but narrow priors. Forward model mismatch pulls the image samples away from the prior manifold, which causes artifacts but can be mitigated with annealed likelihood weight schedules that also increase computational efficiency. Overall, we demonstrate that performance gains do not immediately translate from synthetic to experimental data, and future development must validate against real-world benchmarks.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper shows diffusion priors for sparse-view CT can be tested on real experimental data but the gains from synthetic settings do not carry over cleanly, with domain shift and forward-model mismatch both mattering in ways that annealed schedules can partly fix.

read the letter

The main thing to know is that this work moves beyond pure simulation by measuring actual sparse-view CT from a physical phantom and comparing diffusion priors trained on synthetic data with controlled levels of mismatch to that phantom. They use decomposed diffusion sampling and test on data sets of increasing difficulty up to the real measurements. The results indicate that severe domain shift causes collapse and hallucinations while more diverse priors can match or beat narrowly matched ones, and that forward-model mismatch creates artifacts that annealed likelihood weighting can reduce while also cutting compute time. Performance improvements seen in synthetic cases do not translate directly to the experimental setting, which is the central caution they draw. This is a straightforward empirical extension that fills a gap by using hardware data instead of staying in simulation. The controlled variation of domain shift and the use of a physical phantom give the study some grounding that pure synthetic papers lack. The soft spot is the narrow test case: the phantom closely resembles the classic Shepp-Logan, so the observed nuanced effects of mismatch may not capture the larger anatomical, polychromatic, and scatter variations found in clinical or industrial scans. The abstract also omits quantitative metrics or statistical details, which leaves the strength of the claims harder to judge from the summary alone. Readers working on learned priors for inverse problems in imaging will find the real-data check useful. It is solid enough on its own terms to deserve peer review, though it would benefit from broader test objects in revision.

Referee Report

2 major / 2 minor

Summary. The manuscript investigates the application of diffusion model priors to experimental sparse-view X-ray CT reconstruction. The authors acquire data from a physical phantom resembling the Shepp-Logan phantom, train diffusion priors on synthetic image datasets with controlled degrees of domain shift, and apply them via Decomposed Diffusion Sampling on sparse-view measurements of increasing difficulty up to the experimental case. They report that domain shift plays a nuanced role—severe mismatch leads to collapse and hallucinations while diverse priors match or exceed narrow but well-matched ones—and that annealed likelihood weight schedules mitigate artifacts from forward-model mismatch, though synthetic performance gains do not translate directly to experimental data.

Significance. If the empirical observations hold under broader validation, the work is significant for highlighting practical challenges in transferring generative priors from synthetic to real CT data. It supplies concrete evidence on the effects of domain shift and a mitigation strategy (annealed schedules) that also improves efficiency, offering guidance for future diffusion-based inverse-problem solvers in medical and industrial imaging.

major comments (2)

[Experimental Setup and Results] The central claim that domain shift has a nuanced role and that annealed likelihood schedules mitigate forward-model mismatch rests on measurements from a single physical phantom whose geometry and material properties closely mirror the synthetic Shepp-Logan used for training. Real clinical or industrial CT data exhibit far greater anatomical variability, polychromatic beam hardening, detector-specific noise, and scatter that are absent here. This limitation is load-bearing for the generalization of the reported mitigation strategy and the conclusion that performance gains do not translate.
[Results] The abstract and summary describe qualitative observations of model collapse, hallucinations, and artifact mitigation, yet the provided description contains no quantitative metrics, error bars, or statistical tests comparing diverse versus narrow priors or the effect of annealed schedules. Without these, the robustness of the cross-condition claims cannot be verified.

minor comments (2)

[Methods] Clarify the precise implementation of the annealed likelihood weight schedule and its integration into the Decomposed Diffusion Sampling algorithm, including any hyper-parameter choices.
[Figures] Add scale bars, quantitative error maps, and direct side-by-side comparisons in the reconstruction figures to support visual claims.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback, which helps clarify the scope and presentation of our work on diffusion priors for experimental sparse-view CT. We address each major comment below and indicate the revisions we will make.

read point-by-point responses

Referee: [Experimental Setup and Results] The central claim that domain shift has a nuanced role and that annealed likelihood schedules mitigate forward-model mismatch rests on measurements from a single physical phantom whose geometry and material properties closely mirror the synthetic Shepp-Logan used for training. Real clinical or industrial CT data exhibit far greater anatomical variability, polychromatic beam hardening, detector-specific noise, and scatter that are absent here. This limitation is load-bearing for the generalization of the reported mitigation strategy and the conclusion that performance gains do not translate.

Authors: We agree that reliance on a single physical phantom closely matching the synthetic Shepp-Logan constitutes a genuine limitation for broad generalization claims. Our experimental design deliberately used this controlled phantom to isolate domain shift and forward-model mismatch effects that are otherwise confounded in heterogeneous clinical data. In the revised manuscript we will expand the Discussion and Limitations sections to explicitly state this constraint, qualify that the annealed-schedule mitigation is demonstrated under these specific conditions, and add a forward-looking paragraph on the need for validation against datasets with beam hardening, scatter, and anatomical variability. revision: yes
Referee: [Results] The abstract and summary describe qualitative observations of model collapse, hallucinations, and artifact mitigation, yet the provided description contains no quantitative metrics, error bars, or statistical tests comparing diverse versus narrow priors or the effect of annealed schedules. Without these, the robustness of the cross-condition claims cannot be verified.

Authors: We concur that quantitative support would strengthen the reported observations. Although the present manuscript emphasizes visual evidence to illustrate phenomena such as collapse and artifact reduction, we will add quantitative metrics (e.g., PSNR, SSIM) computed on the reconstructed volumes, include error bars derived from multiple independent sampling runs, and report simple statistical comparisons between prior diversity levels and likelihood schedules in the revised Results section. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected in experimental pipeline

full rationale

The paper describes an empirical workflow: acquiring real CT measurements from a physical phantom, generating synthetic training sets with controlled domain shifts, and applying diffusion priors via an existing sampling scheme to sparse-view data. All reported outcomes (model collapse under severe mismatch, benefits of diverse priors, and mitigation via annealed schedules) are direct observations from these measurements and controlled variations. No mathematical derivation, fitted parameter, or self-citation is shown to define or force the central claims by construction; the results remain falsifiable against the physical data and are not equivalent to the inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central findings rest on the representativeness of the physical phantom and the adequacy of the synthetic training variations to capture real-world mismatch; no new physical entities or large numbers of free parameters are introduced.

axioms (1)

domain assumption The physical phantom data and the chosen synthetic image sets adequately capture the domain shift and forward-model mismatch present in practical CT applications.
This premise is invoked when the authors interpret their experimental results as general lessons for real-world deployment.

pith-pipeline@v0.9.0 · 5722 in / 1330 out tokens · 38688 ms · 2026-05-21T12:44:02.325674+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We employ DDS for reverse diffusion sampling... CG(A⊤A, A⊤y, ˆxt, M) ... annealed likelihood weight schedules
IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean absolute_floor_iff_bare_distinguishability unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

physical phantom resembling the synthetic Shepp-Logan phantom... three training sets with varying domain shift

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

17 extracted references · 17 canonical work pages · 2 internal anchors

[1]

Image reconstruction in circular cone-beam computed tomography by constrained, total-variation minimization,

E. Y . Sidky and X. Pan, “Image reconstruction in circular cone-beam computed tomography by constrained, total-variation minimization,” Physics in Medicine & Biology, vol. 53, no. 17, pp. 4777–4807, 2008

work page 2008
[2]

Prior image constrained compressed sensing (piccs): a method to accurately reconstruct dynamic ct im- ages from highly undersampled projection data sets,

G.-H. Chen, J. Tang, and S. Leng, “Prior image constrained compressed sensing (piccs): a method to accurately reconstruct dynamic ct im- ages from highly undersampled projection data sets,”Medical physics, vol. 35, no. 2, pp. 660–663, 2008

work page 2008
[3]

Deep con- volutional neural network for inverse problems in imaging,

K. H. Jin, M. T. McCann, E. Froustey, and M. Unser, “Deep con- volutional neural network for inverse problems in imaging,”IEEE transactions on image processing, vol. 26, no. 9, pp. 4509–4522, 2017

work page 2017
[4]

Learn: Learned experts’ assessment- based reconstruction network for sparse-data ct,

H. Chen, Y . Zhang, Y . Chen, J. Zhang, W. Zhang, H. Sun, Y . Lv, P. Liao, J. Zhou, and G. Wang, “Learn: Learned experts’ assessment- based reconstruction network for sparse-data ct,”IEEE transactions on medical imaging, vol. 37, no. 6, pp. 1333–1347, 2018

work page 2018
[5]

Learned primal-dual reconstruction,

J. Adler and O. ¨Oktem, “Learned primal-dual reconstruction,”IEEE transactions on medical imaging, vol. 37, no. 6, pp. 1322–1332, 2018

work page 2018
[6]

Solving inverse problems in medical imaging with score-based generative models,

Y . Song, L. Shen, L. Xing, and S. Ermon, “Solving inverse problems in medical imaging with score-based generative models,” inInternational Conference on Learning Representations, 2022

work page 2022
[7]

Improving diffusion models for inverse problems using manifold constraints,

H. Chung, B. Sim, D. Ryu, and J. C. Ye, “Improving diffusion models for inverse problems using manifold constraints,” in36th Conference on Neural Information Processing Systems, NeurIPS 2022. Neural information processing systems foundation, 2022

work page 2022
[8]

Diffusion posterior sampling for general noisy inverse problems,

H. Chung, J. Kim, M. T. Mccann, M. L. Klasky, and J. C. Ye, “Diffusion posterior sampling for general noisy inverse problems,” inThe Eleventh International Conference on Learning Representations, 2023

work page 2023
[9]

Improving diffusion inverse problem solving with decoupled noise an- nealing,

B. Zhang, W. Chu, J. Berner, C. Meng, A. Anandkumar, and Y . Song, “Improving diffusion inverse problem solving with decoupled noise an- nealing,” inProceedings of the Computer Vision and Pattern Recognition Conference, 2025, pp. 20 895–20 905

work page 2025
[10]

Decomposed diffusion sampler for accelerating large-scale inverse problems,

H. Chung, S. Lee, and J. C. Ye, “Decomposed diffusion sampler for accelerating large-scale inverse problems,” in12th International Conference on Learning Representations, ICLR 2024, 2024

work page 2024
[11]

Denoising diffusion probabilistic models,

J. Ho, A. Jain, and P. Abbeel, “Denoising diffusion probabilistic models,” Advances in neural information processing systems, vol. 33, pp. 6840– 6851, 2020

work page 2020
[12]

Denoising Diffusion Implicit Models

J. Song, C. Meng, and S. Ermon, “Denoising diffusion implicit models,” arXiv preprint arXiv:2010.02502, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010
[13]

Tweedie’s formula and selection bias,

B. Efron, “Tweedie’s formula and selection bias,”Journal of the Amer- ican Statistical Association, vol. 106, no. 496, pp. 1602–1614, 2011

work page 2011
[14]

Improved denoising diffusion probabilis- tic models,

A. Q. Nichol and P. Dhariwal, “Improved denoising diffusion probabilis- tic models,” inInternational conference on machine learning. PMLR, 2021, pp. 8162–8171

work page 2021
[15]

The perception-distortion tradeoff,

Y . Blau and T. Michaeli, “The perception-distortion tradeoff,” inPro- ceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 6228–6237

work page 2018
[16]

On hallucinations in tomographic image reconstruction,

S. Bhadra, V . A. Kelkar, F. J. Brooks, and M. A. Anastasio, “On hallucinations in tomographic image reconstruction,”IEEE transactions on medical imaging, vol. 40, no. 11, pp. 3249–3260, 2021

work page 2021
[17]

sFRC for assessing hallucinations in medical image restoration

P. Kc, R. Zeng, N. Soni, and A. Badano, “sfrc for assessing hallucina- tions in medical image restoration,”arXiv preprint arXiv:2603.04673, 2026

work page internal anchor Pith review Pith/arXiv arXiv 2026

[1] [1]

Image reconstruction in circular cone-beam computed tomography by constrained, total-variation minimization,

E. Y . Sidky and X. Pan, “Image reconstruction in circular cone-beam computed tomography by constrained, total-variation minimization,” Physics in Medicine & Biology, vol. 53, no. 17, pp. 4777–4807, 2008

work page 2008

[2] [2]

Prior image constrained compressed sensing (piccs): a method to accurately reconstruct dynamic ct im- ages from highly undersampled projection data sets,

G.-H. Chen, J. Tang, and S. Leng, “Prior image constrained compressed sensing (piccs): a method to accurately reconstruct dynamic ct im- ages from highly undersampled projection data sets,”Medical physics, vol. 35, no. 2, pp. 660–663, 2008

work page 2008

[3] [3]

Deep con- volutional neural network for inverse problems in imaging,

K. H. Jin, M. T. McCann, E. Froustey, and M. Unser, “Deep con- volutional neural network for inverse problems in imaging,”IEEE transactions on image processing, vol. 26, no. 9, pp. 4509–4522, 2017

work page 2017

[4] [4]

Learn: Learned experts’ assessment- based reconstruction network for sparse-data ct,

H. Chen, Y . Zhang, Y . Chen, J. Zhang, W. Zhang, H. Sun, Y . Lv, P. Liao, J. Zhou, and G. Wang, “Learn: Learned experts’ assessment- based reconstruction network for sparse-data ct,”IEEE transactions on medical imaging, vol. 37, no. 6, pp. 1333–1347, 2018

work page 2018

[5] [5]

Learned primal-dual reconstruction,

J. Adler and O. ¨Oktem, “Learned primal-dual reconstruction,”IEEE transactions on medical imaging, vol. 37, no. 6, pp. 1322–1332, 2018

work page 2018

[6] [6]

Solving inverse problems in medical imaging with score-based generative models,

Y . Song, L. Shen, L. Xing, and S. Ermon, “Solving inverse problems in medical imaging with score-based generative models,” inInternational Conference on Learning Representations, 2022

work page 2022

[7] [7]

Improving diffusion models for inverse problems using manifold constraints,

H. Chung, B. Sim, D. Ryu, and J. C. Ye, “Improving diffusion models for inverse problems using manifold constraints,” in36th Conference on Neural Information Processing Systems, NeurIPS 2022. Neural information processing systems foundation, 2022

work page 2022

[8] [8]

Diffusion posterior sampling for general noisy inverse problems,

H. Chung, J. Kim, M. T. Mccann, M. L. Klasky, and J. C. Ye, “Diffusion posterior sampling for general noisy inverse problems,” inThe Eleventh International Conference on Learning Representations, 2023

work page 2023

[9] [9]

Improving diffusion inverse problem solving with decoupled noise an- nealing,

B. Zhang, W. Chu, J. Berner, C. Meng, A. Anandkumar, and Y . Song, “Improving diffusion inverse problem solving with decoupled noise an- nealing,” inProceedings of the Computer Vision and Pattern Recognition Conference, 2025, pp. 20 895–20 905

work page 2025

[10] [10]

Decomposed diffusion sampler for accelerating large-scale inverse problems,

H. Chung, S. Lee, and J. C. Ye, “Decomposed diffusion sampler for accelerating large-scale inverse problems,” in12th International Conference on Learning Representations, ICLR 2024, 2024

work page 2024

[11] [11]

Denoising diffusion probabilistic models,

J. Ho, A. Jain, and P. Abbeel, “Denoising diffusion probabilistic models,” Advances in neural information processing systems, vol. 33, pp. 6840– 6851, 2020

work page 2020

[12] [12]

Denoising Diffusion Implicit Models

J. Song, C. Meng, and S. Ermon, “Denoising diffusion implicit models,” arXiv preprint arXiv:2010.02502, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010

[13] [13]

Tweedie’s formula and selection bias,

B. Efron, “Tweedie’s formula and selection bias,”Journal of the Amer- ican Statistical Association, vol. 106, no. 496, pp. 1602–1614, 2011

work page 2011

[14] [14]

Improved denoising diffusion probabilis- tic models,

A. Q. Nichol and P. Dhariwal, “Improved denoising diffusion probabilis- tic models,” inInternational conference on machine learning. PMLR, 2021, pp. 8162–8171

work page 2021

[15] [15]

The perception-distortion tradeoff,

Y . Blau and T. Michaeli, “The perception-distortion tradeoff,” inPro- ceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 6228–6237

work page 2018

[16] [16]

On hallucinations in tomographic image reconstruction,

S. Bhadra, V . A. Kelkar, F. J. Brooks, and M. A. Anastasio, “On hallucinations in tomographic image reconstruction,”IEEE transactions on medical imaging, vol. 40, no. 11, pp. 3249–3260, 2021

work page 2021

[17] [17]

sFRC for assessing hallucinations in medical image restoration

P. Kc, R. Zeng, N. Soni, and A. Badano, “sfrc for assessing hallucina- tions in medical image restoration,”arXiv preprint arXiv:2603.04673, 2026

work page internal anchor Pith review Pith/arXiv arXiv 2026