PSCT-Net: Geometry-Aware Pediatric Skull CT Reconstruction via Differentiable Back-Projection and Attention-Guided Refinement

Dong Yeong Kim; Jaewon Choi; Jinwook Choi; Joo Whan Kim; JunGyu Lee; Myeongseop Kim; Youmin Shin; Young-Gon Kim

arxiv: 2606.19867 · v3 · pith:HHNGZEEWnew · submitted 2026-06-18 · 💻 cs.CV · cs.AI

PSCT-Net: Geometry-Aware Pediatric Skull CT Reconstruction via Differentiable Back-Projection and Attention-Guided Refinement

Dong Yeong Kim , Jaewon Choi , Youmin Shin , JunGyu Lee , Myeongseop Kim , Jinwook Choi , Joo Whan Kim , Young-Gon Kim This is my paper

Pith reviewed 2026-07-02 21:58 UTC · model grok-4.3

classification 💻 cs.CV cs.AI

keywords pediatric skull CTsparse bi-planar X-raysdifferentiable back-projectionattention-guided projection3D CT reconstructionlow-dose imagingcraniofacial abnormalitiesvolumetric mamba

0 comments

The pith

Differentiable back-projection from bi-planar X-rays creates a spatially faithful 3D prior that reduces depth ambiguity in pediatric skull CT reconstruction.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops PSCT-Net to recover 3D CT volumes of children's skulls from only two X-ray projections as a low-dose alternative to full CT scans. Differentiable back-projection first builds an initial 3D volume that follows the actual X-ray geometry rather than ignoring it. An attention-guided module then refines the mapping from 2D image regions to specific 3D voxels, while a bidirectional Mamba module links distant parts of the volume efficiently. The authors also release a private dataset of normal and pathological pediatric skull scans for evaluation. If the approach holds, clinicians could obtain usable 3D bone detail without exposing developing anatomies to the radiation dose of a conventional CT exam.

Core claim

PSCT-Net claims that differentiable back-projection establishes a spatially faithful volumetric prior from sparse bi-planar X-rays, which directly alleviates depth ambiguity that arises when 2D features are lifted without geometry. The Attention-Guided Projection (AGP-3D) module learns non-linear voxel-wise correspondences between 2D regions and 3D locations, and the Bidirectional Mamba (BiM-3D) module captures long-range volumetric dependencies at linear complexity. Together these steps replace geometry-agnostic feature lifting and produce sharper osseous boundaries on the new PedSkull-CT cohort of pediatric cases.

What carries the argument

Differentiable back-projection that forms a volumetric prior by tracing X-ray paths from the two input views into 3D space using known geometry.

If this is right

Osseous boundaries stay sharper because the initial volume already respects projection geometry.
Depth ambiguity is reduced without requiring dense angular sampling of the X-ray source.
Long-range volumetric context is modeled at linear rather than quadratic cost.
The framework supplies a concrete route to lower-dose 3D craniofacial imaging in pediatrics.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same back-projection prior could be tested on adult head or torso data to check whether the geometry benefit transfers beyond pediatric skulls.
If the attention module learns stable correspondences, it might be swapped into other sparse-view modalities such as cone-beam CT.
Integration with existing metal-artifact reduction steps would be a direct next measurement on the same architecture.

Load-bearing premise

The private institutional PedSkull-CT dataset is representative of the target clinical population and the geometry model in the differentiable back-projection accurately captures real X-ray physics without calibration errors.

What would settle it

Reconstruction metrics degrade sharply when the same model is run on a second dataset acquired with different scanner geometry or on cases drawn from a demographically distinct population.

Figures

Figures reproduced from arXiv: 2606.19867 by Dong Yeong Kim, Jaewon Choi, Jinwook Choi, Joo Whan Kim, JunGyu Lee, Myeongseop Kim, Youmin Shin, Young-Gon Kim.

**Figure 1.** Figure 1: Overview of the proposed approach. Frontal and lateral X-rays are backprojected to form a coarse volumetric prior. This geometric prior is then refined by the generator using original X-rays to reconstruct a high-fidelity CT volume. assess complex skull deformities. Consequently, reconstructing high-fidelity 3D CT volumes from sparse 2D X-ray projections has emerged as a critical yet challenging objective… view at source ↗

**Figure 2.** Figure 2: Overview of PSCT-Net. The framework initializes a coarse volumetric prior via differentiable back-projection. This prior is refined by an encoder-decoder explicitly conditioned by the BP-C and MV3D-C modules to enforce geometric consistency. The network is trained using a compound objective of voxel-wise reconstruction (LG) and projection consistency (Lp). institutional pediatric skull CT cohort comprising… view at source ↗

**Figure 3.** Figure 3: Visualization of proposed modules. (a) BP-C: Back-projects and fuses 2D features for encoder conditioning. (b) MV3D-C: Aligns and averages multi-view 3D features in the decoder. (c) AGP-3D: Maps 2D features to 3D voxels via attentionguided projection. (d) BiM-3D: Refines bottleneck features via bidirectional state space modeling. 2.1 Back-Projection Volumetric Initialization Recovering 3D volumes from 2D … view at source ↗

**Figure 4.** Figure 4: Left: Real-world, DRR [21], and style-transferred X-rays. Right: Real-world reconstructions (red circles denote preserved patient-specific anatomy). 2.5 Loss Function Our training objective comprises three terms to ensure volumetric fidelity, geometric consistency, and texture realism: L = λadvLadv + λrecLrec + λprojLproj, (4) where the balancing weights (λadv, λrec, λproj) are empirically set to (0.1, 10… view at source ↗

**Figure 5.** Figure 5: Qualitative comparison of CT reconstructions [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

read the original abstract

Computed Tomography (CT) is essential for diagnosing pediatric craniofacial abnormalities, yet poses radiation risks to developing anatomies. Reconstructing 3D CT from sparse bi-planar X-rays offers a low-dose alternative but is severely ill-posed. Existing methods employ geometry-agnostic feature lifting, naively projecting 2D features into 3D without explicit spatial modeling, causing depth ambiguity and degraded osseous boundaries. We present PSCT-Net, a geometry-aware framework with differentiable back-projection. Differentiable back-projection establishes a spatially faithful volumetric prior, alleviating depth ambiguity. An Attention-Guided Projection (AGP-3D) module then learns non-linear voxel-wise correspondences between 2D regions and 3D locations. A Bidirectional Mamba (BiM-3D) module captures long-range volumetric dependencies with linear complexity. We further curate a private institutional pediatric skull CT cohort, PedSkull-CT, comprising normal and pathological cases for internal evaluation, addressing the gap in adult-centric, trunk-focused datasets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

PSCT-Net combines differentiable back-projection with attention and Mamba blocks for bi-planar pediatric skull reconstruction, but the geometry model's match to real physics is untested and the abstract supplies no numbers.

read the letter

The paper's main contribution is a geometry-aware network that starts with differentiable back-projection to build a volumetric prior from two X-ray views, then uses an AGP-3D attention module for voxel-wise 2D-to-3D mapping and a BiM-3D Mamba block for long-range dependencies. They also release a private pediatric skull CT dataset with normal and pathological cases. This specific assembly for craniofacial reconstruction from sparse views is not in the cited baselines.

The approach correctly identifies that naive feature lifting ignores acquisition geometry and can produce depth ambiguity. Explicit back-projection is a reasonable way to inject that information, and the choice of linear-complexity Mamba for 3D volumes makes sense for memory reasons.

The soft spot is that the central claim—differentiable back-projection produces a spatially faithful prior—depends on the forward model exactly matching the scanner's source-to-detector distances, angles, and beam characteristics. The abstract gives no calibration checks, phantom reprojection errors, or comparison against a physics simulator, so the alleviation of depth ambiguity remains an assumption rather than a demonstrated result. No quantitative tables or ablations appear in the visible text, which makes it impossible to judge whether the added modules deliver measurable gains over simpler lifting baselines. The private dataset also limits external scrutiny.

This work is aimed at medical imaging groups focused on low-dose pediatric CT. A reader working on reconstruction or radiation reduction would find the framing useful even if the experiments need strengthening. It is coherent on its own terms and engages the relevant literature, so it deserves a serious referee to evaluate the full implementation and results.

Referee Report

1 major / 1 minor

Summary. The manuscript presents PSCT-Net for 3D pediatric skull CT reconstruction from sparse bi-planar X-rays. It claims that differentiable back-projection creates a spatially faithful volumetric prior to alleviate depth ambiguity, an AGP-3D module learns non-linear voxel-wise 2D-3D correspondences, and a BiM-3D module captures long-range volumetric dependencies with linear complexity. A private PedSkull-CT dataset of normal and pathological cases is curated for internal evaluation.

Significance. If the geometric prior is faithful, the method could meaningfully reduce radiation dose in pediatric craniofacial imaging while improving osseous boundary accuracy over geometry-agnostic lifting approaches. The efficiency of the Mamba-based volumetric module is a potential strength for clinical deployment, though the private dataset restricts external reproducibility and generalizability assessments.

major comments (1)

[Abstract] Abstract (and method description): the claim that differentiable back-projection 'establishes a spatially faithful volumetric prior, alleviating depth ambiguity' is load-bearing for the central contribution, yet no calibration validation, phantom-based reprojection error, or comparison against a calibrated ray-tracing simulator is reported to confirm that source-to-detector geometry, beam model, and angles match physical acquisition without unmodeled distortions (heel effect, offsets). This directly affects whether the depth-ambiguity alleviation holds.

minor comments (1)

Notation for the differentiable back-projection operator and the precise formulation of the AGP-3D attention mechanism should be stated explicitly with equations to allow independent implementation.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive comment on the geometric validation of our differentiable back-projection module. We agree that explicit evidence of spatial fidelity is important to support the central claim and will strengthen the manuscript accordingly.

read point-by-point responses

Referee: [Abstract] Abstract (and method description): the claim that differentiable back-projection 'establishes a spatially faithful volumetric prior, alleviating depth ambiguity' is load-bearing for the central contribution, yet no calibration validation, phantom-based reprojection error, or comparison against a calibrated ray-tracing simulator is reported to confirm that source-to-detector geometry, beam model, and angles match physical acquisition without unmodeled distortions (heel effect, offsets). This directly affects whether the depth-ambiguity alleviation holds.

Authors: We acknowledge that the current manuscript does not report dedicated calibration validation, phantom reprojection errors, or comparisons to a ray-tracing simulator. In the revised version we will add: (1) a description of the geometric calibration procedure employed during bi-planar acquisition, (2) quantitative reprojection error statistics measured on a calibration phantom, and (3) a brief comparison of the differentiable back-projection output against a calibrated forward-projection model. These additions will directly substantiate the spatial faithfulness of the volumetric prior and the resulting reduction in depth ambiguity. revision: yes

Circularity Check

0 steps flagged

No circularity detected; architectural modules are independent design choices

full rationale

The abstract and available text describe PSCT-Net as a composition of differentiable back-projection for a volumetric prior, an AGP-3D attention module, and a BiM-3D Mamba module. No equations, parameter-fitting steps, or self-citations are shown that reduce any claimed prediction or prior to its own inputs by construction. The geometry model is presented as an explicit modeling assumption rather than a derived result, and no load-bearing uniqueness theorem or ansatz is imported from prior self-work. The derivation chain therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, axioms, or invented entities; the geometry model inside differentiable back-projection is treated as given.

pith-pipeline@v0.9.1-grok · 5750 in / 1133 out tokens · 16004 ms · 2026-07-02T21:58:50.066448+00:00 · methodology

Review history (3 revisions) →

discussion (0)

Reference graph

Works this paper leans on

33 extracted references · 8 canonical work pages · 2 internal anchors

[1]

Radiology232(3), 739–748 (2004)

Armato III, S.G., McLennan, G., McNitt-Gray, M.F., Meyer, C.R., Yankelevitz, D., Aberle, D.R., Henschke, C.I., Hoffman, E.A., Kazerooni, E.A., MacMahon, H., et al.: Lung image database consortium: developing a resource for the medical imaging research community. Radiology232(3), 739–748 (2004)

2004
[2]

arXiv preprint arXiv:2406.04679 (2024)

Bai, Q., Liu, T., Liu, Z., Tong, Y., Torigian, D., Udupa, J.: Xctdiff: Reconstruction of ct images with consistent anatomical structures from a single radiographic projection image. arXiv preprint arXiv:2406.04679 (2024)

work page arXiv 2024
[3]

arXiv preprint arXiv:2408.10189 (2024)

Bick, A., Li, K.Y., Xing, E.P., Kolter, J.Z., Gu, A.: Transformers to ssms: Distilling quadratic knowledge to subquadratic models. arXiv preprint arXiv:2408.10189 (2024). https://doi.org/10.48550/arXiv.2408.10189

work page doi:10.48550/arxiv.2408.10189 2024
[4]

New England journal of medicine357(22), 2277–2284 (2007)

Brenner, D.J., Hall, E.J.: Computed tomography—an increasing source of radiation exposure. New England journal of medicine357(22), 2277–2284 (2007)

2007
[5]

Computers in biology and medicine154, 106615 (2023)

Chen, Z., Guo, L., Zhang, R., Fang, Z., He, X., Wang, J.: Bx2s-net: Learning to reconstruct 3d spinal structures from bi-planar x-ray images. Computers in biology and medicine154, 106615 (2023)

2023
[6]

Deng, Y., Wang, C., Hui, Y., Li, Q., Li, J., Luo, S., Sun, M., Quan, Q., Yang, S., Hao, Y., Liu, P., Xiao, H., Zhao, C., Wu, X., Zhou, S.K.: Ctspine1k: A large- scale dataset for spinal vertebrae segmentation in computed tomography (2024), https://arxiv.org/abs/2105.14711

work page arXiv 2024
[7]

Knowledge-Based Systems236, 107680 (2022)

Ge, R., He, Y., Xia, C., Xu, C., Sun, W., Yang, G., Li, J., Wang, Z., Yu, H., Zhang, D., et al.: X-ctrsnet: 3d cervical vertebra ct reconstruction and segmentation directly from 2d x-ray images. Knowledge-Based Systems236, 107680 (2022)

2022
[8]

Goske, M.J., Applegate, K.E., Boylan, J., Butler, P.F., Callahan, M.J., Coley, B.D., Farley, S., Frush, D.P., Hernanz-Schulman, M., Jaramillo, D., et al.: The image gently campaign: working together to change practice (2008)

2008
[9]

In: First conference on language modeling (2024)

Gu, A., Dao, T.: Mamba: Linear-time sequence modeling with selective state spaces. In: First conference on language modeling (2024)

2024
[10]

In: Computer Graphics Forum

Henzler, P., Rasche, V., Ropinski, T., Ritschel, T.: Single-image tomography: 3d volumes from 2d cranial x-rays. In: Computer Graphics Forum. vol. 37, pp. 377–388. Wiley Online Library (2018)

2018
[11]

In: Proceedings of the IEEE conference on computer vision and pattern recognition

Isola, P., Zhu, J.Y., Zhou, T., Efros, A.A.: Image-to-image translation with condi- tional adversarial networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 1125–1134 (2017)

2017
[12]

SIAM (2001)

Kak, A.C., Slaney, M.: Principles of computerized tomographic imaging. SIAM (2001)

2001
[13]

ACM Trans

Kerbl, B., Kopanas, G., Leimkühler, T., Drettakis, G.: 3d gaussian splatting for real-time radiance field rendering. ACM Trans. Graph.42(4), 139–1 (2023) 10 D.Y. Kim et al

2023
[14]

Studies in Health Technology and Informatics329, 578–582 (2025)

Kim, D.Y., Kim, J.W., Kim, S.K., Kim, Y.G.: Multi-modal and multi-view fu- sion classifier for craniosynostosis diagnosis. Studies in Health Technology and Informatics329, 578–582 (2025)

2025
[15]

Liu, P., Han, H., Du, Y., Zhu, H., Li, Y., Gu, F., Xiao, H., Li, J., Zhao, C., Xiao, L., Wu, X., Zhou, S.K.: Deep learning to segment pelvic bones: Large-scale ct datasets and baseline models (2021), https://arxiv.org/abs/2012.08721

work page arXiv 2021
[16]

In: European conference on computer vision

Liu, X., Qiao, Z., Liu, R., Li, H., Zhang, J., Zhen, X., Qian, Z., Zhang, B.: Diffux2ct: Diffusion learning to reconstruct ct images from biplanar x-rays. In: European conference on computer vision. pp. 458–476. Springer (2024)

2024
[17]

U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation

Ma,J.,Li,F.,Wang,B.:U-mamba:Enhancinglong-rangedependencyforbiomedical image segmentation. arXiv preprint arXiv:2401.04722 (2024)

work page internal anchor Pith review Pith/arXiv arXiv 2024
[18]

In: Proceedings of the IEEE international conference on computer vision

Mao, X., Li, Q., Xie, H., Lau, R.Y., Wang, Z., Paul Smolley, S.: Least squares gen- erative adversarial networks. In: Proceedings of the IEEE international conference on computer vision. pp. 2794–2802 (2017)

2017
[19]

Radiology248(1), 254–263 (2008)

Mettler Jr, F.A., Huda, W., Yoshizumi, T.T., Mahesh, M.: Effective doses in radiology and diagnostic nuclear medicine: a catalog. Radiology248(1), 254–263 (2008)

2008
[20]

Commu- nications of the ACM65(1), 99–106 (2021)

Mildenhall, B., Srinivasan, P.P., Tancik, M., Barron, J.T., Ramamoorthi, R., Ng, R.: Nerf: Representing scenes as neural radiance fields for view synthesis. Commu- nications of the ACM65(1), 99–106 (2021)

2021
[21]

Physics in Medicine & Biology45(10), 2787 (2000)

Milickovic, N., Baltas, D., Giannouli, S., Lahanas, M., Zamboglou, N.: Ct imaging based digitally reconstructed radiographs and their application inbrachytherapy. Physics in Medicine & Biology45(10), 2787 (2000)

2000
[22]

In: Proceedings of the AAAI Conference on Artificial Intelligence

Peng, C., Liao, H., Wong, G., Luo, J., Zhou, S.K., Chellappa, R.: Xraysyn: Realistic view synthesis from a single radiograph through ct priors. In: Proceedings of the AAAI Conference on Artificial Intelligence. vol. 35, pp. 436–444 (2021)

2021
[23]

ACM Transactions on Multimedia Computing, Communications and Applications (2024)

Ruan, J., Li, J., Xiang, S.: Vm-unet: Vision mamba unet for medical image seg- mentation. ACM Transactions on Multimedia Computing, Communications and Applications (2024)

2024
[24]

Nature biomedical engineering3(11), 880–888 (2019)

Shen, L., Zhao, W., Xing, L.: Patient-specific reconstruction of volumetric com- puted tomography images from a single projection view via deep learning. Nature biomedical engineering3(11), 880–888 (2019)

2019
[25]

Denoising Diffusion Implicit Models

Song, J., Meng, C., Ermon, S.: Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502 (2020)

work page internal anchor Pith review Pith/arXiv arXiv 2010
[26]

Radiology173(3), 669–673 (1989)

Vannier, M.W., Hildebolt, C.F., Marsh, J.L., Pilgram, T.K., McAlister, W.H., Shackelford, G.D., Offutt, C.J., Knapp, R.H.: Craniosynostosis: diagnostic value of three-dimensional ct reconstruction. Radiology173(3), 669–673 (1989)

1989
[27]

Advances in neural information processing systems30(2017)

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. Advances in neural information processing systems30(2017)

2017
[28]

MiniLM: Deep self-attention distillation for task-agnostic compression of pre-trained transformers

Wang, W., Wei, F., Dong, L., Bao, H., Yang, N., Zhou, M.: Minilm: Deep self- attention distillation for task-agnostic compression of pre-trained transformers. arXiv preprint arXiv:2002.10957 (2020)

work page arXiv 2002
[29]

Digital Signal Processing140, 104123 (2023)

Wang, Y., Sun, Z.L., Zeng, Z., Lam, K.M.: Trct-gan: Ct reconstruction from biplane x-rays using transformer and generative adversarial networks. Digital Signal Processing140, 104123 (2023)

2023
[30]

arXiv preprint arXiv:2503.17804 (2025) PSCT-Net for Pediatric Skull CT Reconstruction 11

Xie, X., Liu, J., Fan, H., Han, Z., Tang, Y., Qu, L.: Dvg-diffusion: Dual-view guided diffusion model for ct reconstruction from x-rays. arXiv preprint arXiv:2503.17804 (2025) PSCT-Net for Pediatric Skull CT Reconstruction 11

work page arXiv 2025
[31]

In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition

Ying, X., Guo, H., Ma, K., Wu, J., Weng, Z., Zheng, Y.: X2ct-gan: reconstructing ct from biplanar x-rays with generative adversarial networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 10619–10628 (2019)

2019
[32]

In: Proceedings of the IEEE conference on computer vision and pattern recognition

Zhang, R., Isola, P., Efros, A.A., Shechtman, E., Wang, O.: The unreasonable effectiveness of deep features as a perceptual metric. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 586–595 (2018)

2018
[33]

In: Proceedings of the IEEE international conference on computer vision

Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE international conference on computer vision. pp. 2223–2232 (2017)

2017

[1] [1]

Radiology232(3), 739–748 (2004)

Armato III, S.G., McLennan, G., McNitt-Gray, M.F., Meyer, C.R., Yankelevitz, D., Aberle, D.R., Henschke, C.I., Hoffman, E.A., Kazerooni, E.A., MacMahon, H., et al.: Lung image database consortium: developing a resource for the medical imaging research community. Radiology232(3), 739–748 (2004)

2004

[2] [2]

arXiv preprint arXiv:2406.04679 (2024)

Bai, Q., Liu, T., Liu, Z., Tong, Y., Torigian, D., Udupa, J.: Xctdiff: Reconstruction of ct images with consistent anatomical structures from a single radiographic projection image. arXiv preprint arXiv:2406.04679 (2024)

work page arXiv 2024

[3] [3]

arXiv preprint arXiv:2408.10189 (2024)

Bick, A., Li, K.Y., Xing, E.P., Kolter, J.Z., Gu, A.: Transformers to ssms: Distilling quadratic knowledge to subquadratic models. arXiv preprint arXiv:2408.10189 (2024). https://doi.org/10.48550/arXiv.2408.10189

work page doi:10.48550/arxiv.2408.10189 2024

[4] [4]

New England journal of medicine357(22), 2277–2284 (2007)

Brenner, D.J., Hall, E.J.: Computed tomography—an increasing source of radiation exposure. New England journal of medicine357(22), 2277–2284 (2007)

2007

[5] [5]

Computers in biology and medicine154, 106615 (2023)

Chen, Z., Guo, L., Zhang, R., Fang, Z., He, X., Wang, J.: Bx2s-net: Learning to reconstruct 3d spinal structures from bi-planar x-ray images. Computers in biology and medicine154, 106615 (2023)

2023

[6] [6]

Deng, Y., Wang, C., Hui, Y., Li, Q., Li, J., Luo, S., Sun, M., Quan, Q., Yang, S., Hao, Y., Liu, P., Xiao, H., Zhao, C., Wu, X., Zhou, S.K.: Ctspine1k: A large- scale dataset for spinal vertebrae segmentation in computed tomography (2024), https://arxiv.org/abs/2105.14711

work page arXiv 2024

[7] [7]

Knowledge-Based Systems236, 107680 (2022)

Ge, R., He, Y., Xia, C., Xu, C., Sun, W., Yang, G., Li, J., Wang, Z., Yu, H., Zhang, D., et al.: X-ctrsnet: 3d cervical vertebra ct reconstruction and segmentation directly from 2d x-ray images. Knowledge-Based Systems236, 107680 (2022)

2022

[8] [8]

Goske, M.J., Applegate, K.E., Boylan, J., Butler, P.F., Callahan, M.J., Coley, B.D., Farley, S., Frush, D.P., Hernanz-Schulman, M., Jaramillo, D., et al.: The image gently campaign: working together to change practice (2008)

2008

[9] [9]

In: First conference on language modeling (2024)

Gu, A., Dao, T.: Mamba: Linear-time sequence modeling with selective state spaces. In: First conference on language modeling (2024)

2024

[10] [10]

In: Computer Graphics Forum

Henzler, P., Rasche, V., Ropinski, T., Ritschel, T.: Single-image tomography: 3d volumes from 2d cranial x-rays. In: Computer Graphics Forum. vol. 37, pp. 377–388. Wiley Online Library (2018)

2018

[11] [11]

In: Proceedings of the IEEE conference on computer vision and pattern recognition

Isola, P., Zhu, J.Y., Zhou, T., Efros, A.A.: Image-to-image translation with condi- tional adversarial networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 1125–1134 (2017)

2017

[12] [12]

SIAM (2001)

Kak, A.C., Slaney, M.: Principles of computerized tomographic imaging. SIAM (2001)

2001

[13] [13]

ACM Trans

Kerbl, B., Kopanas, G., Leimkühler, T., Drettakis, G.: 3d gaussian splatting for real-time radiance field rendering. ACM Trans. Graph.42(4), 139–1 (2023) 10 D.Y. Kim et al

2023

[14] [14]

Studies in Health Technology and Informatics329, 578–582 (2025)

Kim, D.Y., Kim, J.W., Kim, S.K., Kim, Y.G.: Multi-modal and multi-view fu- sion classifier for craniosynostosis diagnosis. Studies in Health Technology and Informatics329, 578–582 (2025)

2025

[15] [15]

Liu, P., Han, H., Du, Y., Zhu, H., Li, Y., Gu, F., Xiao, H., Li, J., Zhao, C., Xiao, L., Wu, X., Zhou, S.K.: Deep learning to segment pelvic bones: Large-scale ct datasets and baseline models (2021), https://arxiv.org/abs/2012.08721

work page arXiv 2021

[16] [16]

In: European conference on computer vision

Liu, X., Qiao, Z., Liu, R., Li, H., Zhang, J., Zhen, X., Qian, Z., Zhang, B.: Diffux2ct: Diffusion learning to reconstruct ct images from biplanar x-rays. In: European conference on computer vision. pp. 458–476. Springer (2024)

2024

[17] [17]

U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation

Ma,J.,Li,F.,Wang,B.:U-mamba:Enhancinglong-rangedependencyforbiomedical image segmentation. arXiv preprint arXiv:2401.04722 (2024)

work page internal anchor Pith review Pith/arXiv arXiv 2024

[18] [18]

In: Proceedings of the IEEE international conference on computer vision

Mao, X., Li, Q., Xie, H., Lau, R.Y., Wang, Z., Paul Smolley, S.: Least squares gen- erative adversarial networks. In: Proceedings of the IEEE international conference on computer vision. pp. 2794–2802 (2017)

2017

[19] [19]

Radiology248(1), 254–263 (2008)

Mettler Jr, F.A., Huda, W., Yoshizumi, T.T., Mahesh, M.: Effective doses in radiology and diagnostic nuclear medicine: a catalog. Radiology248(1), 254–263 (2008)

2008

[20] [20]

Commu- nications of the ACM65(1), 99–106 (2021)

Mildenhall, B., Srinivasan, P.P., Tancik, M., Barron, J.T., Ramamoorthi, R., Ng, R.: Nerf: Representing scenes as neural radiance fields for view synthesis. Commu- nications of the ACM65(1), 99–106 (2021)

2021

[21] [21]

Physics in Medicine & Biology45(10), 2787 (2000)

Milickovic, N., Baltas, D., Giannouli, S., Lahanas, M., Zamboglou, N.: Ct imaging based digitally reconstructed radiographs and their application inbrachytherapy. Physics in Medicine & Biology45(10), 2787 (2000)

2000

[22] [22]

In: Proceedings of the AAAI Conference on Artificial Intelligence

Peng, C., Liao, H., Wong, G., Luo, J., Zhou, S.K., Chellappa, R.: Xraysyn: Realistic view synthesis from a single radiograph through ct priors. In: Proceedings of the AAAI Conference on Artificial Intelligence. vol. 35, pp. 436–444 (2021)

2021

[23] [23]

ACM Transactions on Multimedia Computing, Communications and Applications (2024)

Ruan, J., Li, J., Xiang, S.: Vm-unet: Vision mamba unet for medical image seg- mentation. ACM Transactions on Multimedia Computing, Communications and Applications (2024)

2024

[24] [24]

Nature biomedical engineering3(11), 880–888 (2019)

Shen, L., Zhao, W., Xing, L.: Patient-specific reconstruction of volumetric com- puted tomography images from a single projection view via deep learning. Nature biomedical engineering3(11), 880–888 (2019)

2019

[25] [25]

Denoising Diffusion Implicit Models

Song, J., Meng, C., Ermon, S.: Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502 (2020)

work page internal anchor Pith review Pith/arXiv arXiv 2010

[26] [26]

Radiology173(3), 669–673 (1989)

Vannier, M.W., Hildebolt, C.F., Marsh, J.L., Pilgram, T.K., McAlister, W.H., Shackelford, G.D., Offutt, C.J., Knapp, R.H.: Craniosynostosis: diagnostic value of three-dimensional ct reconstruction. Radiology173(3), 669–673 (1989)

1989

[27] [27]

Advances in neural information processing systems30(2017)

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. Advances in neural information processing systems30(2017)

2017

[28] [28]

MiniLM: Deep self-attention distillation for task-agnostic compression of pre-trained transformers

Wang, W., Wei, F., Dong, L., Bao, H., Yang, N., Zhou, M.: Minilm: Deep self- attention distillation for task-agnostic compression of pre-trained transformers. arXiv preprint arXiv:2002.10957 (2020)

work page arXiv 2002

[29] [29]

Digital Signal Processing140, 104123 (2023)

Wang, Y., Sun, Z.L., Zeng, Z., Lam, K.M.: Trct-gan: Ct reconstruction from biplane x-rays using transformer and generative adversarial networks. Digital Signal Processing140, 104123 (2023)

2023

[30] [30]

arXiv preprint arXiv:2503.17804 (2025) PSCT-Net for Pediatric Skull CT Reconstruction 11

Xie, X., Liu, J., Fan, H., Han, Z., Tang, Y., Qu, L.: Dvg-diffusion: Dual-view guided diffusion model for ct reconstruction from x-rays. arXiv preprint arXiv:2503.17804 (2025) PSCT-Net for Pediatric Skull CT Reconstruction 11

work page arXiv 2025

[31] [31]

In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition

Ying, X., Guo, H., Ma, K., Wu, J., Weng, Z., Zheng, Y.: X2ct-gan: reconstructing ct from biplanar x-rays with generative adversarial networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 10619–10628 (2019)

2019

[32] [32]

In: Proceedings of the IEEE conference on computer vision and pattern recognition

Zhang, R., Isola, P., Efros, A.A., Shechtman, E., Wang, O.: The unreasonable effectiveness of deep features as a perceptual metric. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 586–595 (2018)

2018

[33] [33]

In: Proceedings of the IEEE international conference on computer vision

Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE international conference on computer vision. pp. 2223–2232 (2017)

2017