Bilevel Optimized Implicit Neural Representation for Scan-Specific Accelerated MRI Reconstruction
Pith reviewed 2026-05-23 01:54 UTC · model grok-4.3
The pith
Bilevel optimization tunes implicit neural representation hyperparameters for scan-specific MRI reconstruction without training data.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that bilevel optimization of an INR—with a trainable positional encoder for feature embedding and a small multilayer perceptron decoder—via Gaussian process regression on the reconstruction objective automatically selects hyperparameters tailored to a given undersampled acquisition, yielding improved image quality over prior model-based and self-supervised techniques while completing the scan-specific reconstruction in seconds after a few minutes of optimization per 2D Cartesian scan.
What carries the argument
Bilevel optimization with Gaussian process regression over the hyperparameters of a trainable positional encoder plus multilayer perceptron implicit neural representation.
If this is right
- Hyperparameters are chosen automatically for each acquisition protocol without external training data.
- Optimization finishes in a few minutes per typical 2D Cartesian scan.
- Final reconstruction runs in seconds on scanner hardware.
- Image quality exceeds that of previous model-based and self-supervised learning methods.
- The framework accommodates different acquisitions through the same automated process.
Where Pith is reading between the lines
- The same bilevel structure might allow extension to non-Cartesian or 3D trajectories if the INR positional encoder can be adapted without retraining from scratch.
- Clinical deployment would benefit from verifying that the per-scan optimization remains stable across repeated scans of the same patient anatomy.
- If the method generalizes, it could reduce the need for separate reconstruction pipelines for each MRI vendor or field strength.
Load-bearing premise
Gaussian process regression applied to the bilevel objective will locate hyperparameters that recover the true underlying image instead of overfitting to the specific undersampled measurements or noise pattern of one scan.
What would settle it
Reconstructing a fully sampled reference scan with the optimized hyperparameters and finding higher error metrics or visible artifacts compared with standard non-INR methods would falsify the improvement claim.
Figures
read the original abstract
Deep Learning (DL) methods can reconstruct highly accelerated magnetic resonance imaging (MRI) scans, but they rely on application-specific large training datasets and often generalize poorly to out-of-distribution data. Self-supervised deep learning algorithms perform scan-specific reconstructions, but still require complicated hyperparameter tuning based on the acquisition and often offer limited acceleration. This work develops a bilevel-optimized implicit neural representation (INR) approach for scan-specific MRI reconstruction. The method automatically optimizes the hyperparameters for a given acquisition protocol, enabling a tailored reconstruction without training data. The proposed algorithm uses Gaussian process regression to optimize INR hyperparameters, accommodating various acquisitions. The INR includes a trainable positional encoder for high-dimensional feature embedding and a small multilayer perceptron for decoding. The bilevel optimization is computationally efficient, requiring only a few minutes per typical 2D Cartesian scan. On scanner hardware, the subsequent scan-specific reconstruction-using offline-optimized hyperparameters-is completed within seconds and achieves improved image quality compared to previous model-based and self-supervised learning methods.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a bilevel optimization framework for scan-specific accelerated MRI reconstruction using implicit neural representations (INRs). An outer loop employs Gaussian process regression to automatically tune hyperparameters of a trainable positional encoder and small MLP decoder; the inner loop performs the INR fit subject to a data-consistency term on the undersampled k-space measurements. The method is claimed to require only minutes of offline optimization per 2D Cartesian scan, after which reconstruction completes in seconds on scanner hardware and yields higher image quality than prior model-based and self-supervised baselines.
Significance. If the central claims hold, the work would be significant for scan-specific MRI by removing manual hyperparameter search while retaining the flexibility of INRs. The combination of bilevel optimization with GP regression for hyperparameter selection is a technically interesting direction that could generalize to other inverse problems. The reported computational profile (minutes offline, seconds online) is a practical strength if substantiated by timing tables.
major comments (2)
- [§3] §3 (Bilevel formulation): The outer objective is defined solely on the data-consistency residual of the given undersampled measurements. No independent validation split, noise-robust regularizer, or multi-realization test is described that would prevent the GP from selecting hyperparameters that overfit the particular noise realization or residual aliasing; this directly undermines the claim that the optimized INR recovers improved image quality rather than fitting acquisition artifacts.
- [§4.2] §4.2 (Quantitative results): The reported PSNR/SSIM gains versus baselines must be accompanied by per-scan standard deviations and statistical tests across at least 10–20 independent acquisitions; without this, the single-scan quality improvement cannot be distinguished from favorable noise realizations.
minor comments (2)
- [§2] The notation distinguishing the positional-encoder parameters from the MLP weights is introduced without an explicit equation reference; adding a compact table of symbols would improve readability.
- [Figure 3] Figure 3 caption should state the exact acceleration factor and sampling mask used for the displayed slices.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. We respond to each major comment below, indicating planned revisions where appropriate.
read point-by-point responses
-
Referee: [§3] §3 (Bilevel formulation): The outer objective is defined solely on the data-consistency residual of the given undersampled measurements. No independent validation split, noise-robust regularizer, or multi-realization test is described that would prevent the GP from selecting hyperparameters that overfit the particular noise realization or residual aliasing; this directly undermines the claim that the optimized INR recovers improved image quality rather than fitting acquisition artifacts.
Authors: We acknowledge the validity of this observation. Because the approach is strictly scan-specific, the only measurements available for the outer objective are the given undersampled k-space data; no separate validation split exists by design. The limited capacity of the MLP decoder and the smoothing effect of Gaussian process regression provide implicit safeguards, yet these do not constitute an explicit guard against overfitting to a particular noise realization. We will revise §3 to explicitly discuss this limitation and to outline possible future extensions, such as the addition of a noise-robust regularizer to the outer objective. revision: partial
-
Referee: [§4.2] §4.2 (Quantitative results): The reported PSNR/SSIM gains versus baselines must be accompanied by per-scan standard deviations and statistical tests across at least 10–20 independent acquisitions; without this, the single-scan quality improvement cannot be distinguished from favorable noise realizations.
Authors: We agree that statistical rigor across multiple acquisitions is required to support the reported gains. The current manuscript presents results on representative scans without aggregated statistics. We will expand the evaluation in §4.2 to include at least 15 independent acquisitions, reporting mean ± standard deviation for PSNR and SSIM together with paired statistical tests (e.g., Wilcoxon signed-rank) against the baselines. revision: yes
Circularity Check
No circularity: bilevel optimization is a standard fitting procedure with external evaluation
full rationale
The paper presents a bilevel optimization framework in which Gaussian process regression tunes INR hyperparameters (positional encoder + MLP) to minimize a data-consistency objective on the given undersampled k-space measurements. This is a conventional hyperparameter search whose output is then used for reconstruction and compared to external baselines via image quality metrics. No equations or steps reduce by construction to their inputs, no self-citations are load-bearing for the central claim, and no uniqueness theorems or ansatzes are imported from prior author work. The method is self-contained as an optimization algorithm whose performance claims rest on empirical comparison rather than tautological re-derivation.
Axiom & Free-Parameter Ledger
Forward citations
Cited by 2 Pith papers
-
Discovery of unobservable parameters via physical embedding
PEIL learns unobservable parameters by embedding them in a physics-based reconstruction loop, outperforming supervised baselines with ground-truth access while enabling zero-shot generalization and major data reductio...
-
Towards a Unified Theoretical Framework for Splitting-based Self-Supervised MRI Reconstruction
UNITS framework proves self-supervised splitting risk in MRI reconstruction is a weighted supervised risk, yielding identical Bayes-optimal predictors and relating training residuals to prediction bias.
Reference graph
Works this paper leans on
-
[1]
SENSE: Sensitivity encoding for fast MRI,
K. P. Pruessmann, M. Weiger, M. B. Scheidegger, and P. Boesiger, “SENSE: Sensitivity encoding for fast MRI,” Magn. Reson. Med. , vol. 42, no. 5, pp. 952–962, Nov. 1999
work page 1999
-
[2]
Generalized autocalibrating partially parallel acquisitions (GRAPPA),
M. A. Griswold et al. , “Generalized autocalibrating partially parallel acquisitions (GRAPPA),” Magn. Reson. Med. , vol. 47, no. 6, pp. 1202– 1210, Jun. 2002
work page 2002
-
[3]
M. Lustig, D. L. Donoho, J. M. Santos, and J. M. Pauly, “Compressed sensing MRI,” IEEE Signal Process. Mag. , vol. 25, no. 2, pp. 72–82, 2008
work page 2008
-
[4]
K. T. Block, M. Uecker, and J. Frahm, “Undersampled radial MRI with multiple coils. Iterative image reconstruction using a total variation constraint,” Magn. Reson. Med. , vol. 57, no. 6, pp. 1086–1098, Jun. 2007
work page 2007
-
[5]
Low-rank modeling of local k-space neighborhoods (LORAKS) for constrained MRI,
J. P. Haldar, “Low-rank modeling of local k-space neighborhoods (LORAKS) for constrained MRI,” IEEE Trans. Med. Imag. , vol. 33, no. 3, pp. 668–681, Mar. 2014
work page 2014
-
[6]
P-LORAKS: Low-rank modeling of local k- space neighborhoods with parallel imaging data,
J. P. Haldar and J. Zhuo, “P-LORAKS: Low-rank modeling of local k- space neighborhoods with parallel imaging data,” Magn. Reson. Med. , vol. 75, no. 4, pp. 1499–1514, Apr. 2016
work page 2016
-
[7]
K. H. Jin, D. Lee, and J. C. Ye, “A general framework for compressed sensing and parallel MRI using annihilating filter based low-rank Hankel matrix,” IEEE Trans. Comput. Imag. , vol. 2, no. 4, pp. 480–495, 2016
work page 2016
-
[8]
k-t FOCUSS: a general compressed sensing framework for high resolution dynamic MRI,
H. Jung, K. Sung, K. S. Nayak, E. Y . Kim, and J. C. Ye, “k-t FOCUSS: a general compressed sensing framework for high resolution dynamic MRI,” Magn. Reson. Med. , vol. 61, no. 1, pp. 103–116, Jan. 2009
work page 2009
-
[9]
L. Feng et al., “Golden-angle radial sparse parallel MRI: combination of compressed sensing, parallel imaging, and golden-angle radial sampling for fast and flexible dynamic volumetric MRI,” Magn. Reson. Med. , vol. 72, no. 3, pp. 707–717, Sep. 2014
work page 2014
-
[10]
L. Feng, L. Axel, H. Chandarana, K. T. Block, D. K. Sodickson, and R. Otazo, “XD-GRASP: Golden-angle radial MRI with reconstruction of extra motion-state dimensions using compressed sensing,” Magn. Reson. Med., vol. 75, no. 2, pp. 775–788, Feb. 2016
work page 2016
-
[11]
Sparsity and locally low rank regularization for MR fingerprinting,
G. L. da Cruz, A. Bustin, O. Jaubert, T. Schneider, R. M. Botnar, and C. Prieto, “Sparsity and locally low rank regularization for MR fingerprinting,” Magn. Reson. Med. , vol. 81, no. 6, pp. 3530–3543, Jun. 2019
work page 2019
-
[12]
Compressed sensing MRI: A review from signal processing perspective,
J. C. Ye, “Compressed sensing MRI: A review from signal processing perspective,” BMC Biomedical Engineering , vol. 1, p. 8, Mar. 2019
work page 2019
-
[13]
fastMRI: An Open Dataset and Benchmarks for Accelerated MRI
J. Zbontar et al. , “fastMRI: An open dataset and benchmarks for accelerated MRI,” 2018, arXiv preprint arXiv:1811.08839. [Online]. Available: https://arxiv.org/abs/1811.08839
work page internal anchor Pith review arXiv 2018
-
[14]
A parallel MR imaging method using multilayer perceptron,
K. Kwon, D. Kim, and H. Park, “A parallel MR imaging method using multilayer perceptron,” Medical Physics, vol. 44, no. 12, pp. 6209–6224, 2017
work page 2017
-
[15]
Deep learning for undersampled MRI reconstruction,
C. M. Hyun, H. P. Kim, S. M. Lee, S. Lee, and J. K. Seo, “Deep learning for undersampled MRI reconstruction,” Physics in Medicine & Biology , vol. 63, no. 13, p. 135007, Jun. 2018
work page 2018
-
[16]
MoDL: Model-based deep learning architecture for inverse problems,
H. K. Aggarwal, M. P. Mani, and M. Jacob, “MoDL: Model-based deep learning architecture for inverse problems,” IEEE Trans. Med. Imag. , vol. 38, no. 2, pp. 394–405, 2019
work page 2019
-
[17]
ISTA-Net: Interpretable optimization-inspired deep network for image compressive sensing,
J. Zhang and B. Ghanem, “ISTA-Net: Interpretable optimization-inspired deep network for image compressive sensing,” in Proc. CVPR, 2018, pp. 1828–1837
work page 2018
-
[18]
F. Knoll et al., “Deep-learning methods for parallel magnetic resonance imaging reconstruction: A survey of the current approaches, trends, and issues,” IEEE Signal Proc. Mag. , vol. 37, no. 1, pp. 128–140, 2020
work page 2020
-
[19]
G. M ˚artensson et al. , “The reliability of a deep learning model in clinical out-of-distribution MRI data: A multicohort study,” Medical Image Analysis , vol. 66, p. 101714, 2020
work page 2020
-
[20]
A. D. Desai et al., “Noise2Recon: Enabling SNR-robust MRI reconstruc- tion with semi-supervised and self-supervised learning,” Magn. Reson. Med., vol. 90, no. 5, pp. 2052–2070, 2023
work page 2052
-
[21]
B. Yaman, S. Hosseini, S. Moeller, J. Ellermann, K. U ˘gurbil, and M. Akc ¸akaya, “Self-supervised learning of physics-guided reconstruc- tion neural networks without fully sampled reference data,” Magn. Reson. Med. , vol. 84, no. 6, pp. 3172–3191, 2020
work page 2020
-
[22]
D. Ulyanov, A. Vedaldi, and V . Lempitsky, “Deep image prior,” 2017, arXiv preprint arXiv:1711.10925. [Online]. Available: https: //arxiv.org/abs/1711.10925
-
[23]
A. P. Leynes, N. Deveshwar, S. S. Nagarajan, and P. E. Z. Larson, “Scan-specific self-supervised Bayesian deep non-linear inversion for undersampled MRI reconstruction,” IEEE Trans. Med. Imag. , vol. 43, no. 6, pp. 2358–2369, 2024
work page 2024
-
[24]
NeRF: Representing scenes as neural radiance fields for view synthesis,
B. Mildenhall, P. P. Srinivasan, M. Tancik, J. T. Barron, R. Ramamoorthi, and R. Ng, “NeRF: Representing scenes as neural radiance fields for view synthesis,” in Proc. ECCV, 2020
work page 2020
-
[25]
L. Shen, J. Pauly, and L. Xing, “NeRP: Implicit neural representation learning with prior embedding for sparsely sampled image reconstruc- tion,” IEEE Trans. Neural Netw. and Learn. Syst. , vol. 35, no. 1, pp. 770–782, 2024
work page 2024
-
[26]
R. Feng et al., “IMJENSE: Scan-specific implicit representation for joint coil sensitivity and image estimation in parallel MRI,” IEEE Trans. on Med. Imag. , vol. 43, no. 4, pp. 1539–1553, 2024
work page 2024
-
[27]
B. Liu, H. She, and Y . P. Du, “Scan-specific unsupervised highly accel- erated non-cartesian CEST imaging using implicit neural representation and explicit sparse prior,” IEEE Trans. Biomed. Eng. , vol. 71, no. 10, pp. 3032–3045, 2024
work page 2024
-
[28]
Spatiotemporal implicit neural representation for unsu- pervised dynamic MRI reconstruction,
J. Feng et al. , “Spatiotemporal implicit neural representation for unsu- pervised dynamic MRI reconstruction,” IEEE Trans. Med. Imag. , pp. 1–1, 2025
work page 2025
-
[29]
Bilevel methods for image reconstruc- tion,
C. Crockett and J. A. Fessler, “Bilevel methods for image reconstruc- tion,” F oundations and Trends® in Signal Processing , vol. 15, no. 2–3, p. 121–289, 2022
work page 2022
-
[30]
A Tutorial on Bayesian Optimization
P. I. Frazier, “A tutorial on Bayesian optimization,” 2018, arXiv preprint arXiv:1807.02811. [Online]. Available: https://arxiv.org/abs/1807.02811
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[31]
J. A. Fessler, “Optimization methods for magnetic resonance image reconstruction: Key models and optimization algorithms,” IEEE Signal Proc. Mag., vol. 37, no. 1, pp. 33–40, 2020
work page 2020
-
[32]
Low rank matrix recovery for real-time cardiac MRI,
B. Zhao, J. P. Haldar, C. Brinegar, and Z.-P. Liang, “Low rank matrix recovery for real-time cardiac MRI,” in Proc. IEEE Int. Symp. on Biomed. Imag. (ISBI): From Nano to Macro , 2010, pp. 996–999
work page 2010
-
[33]
Instant neural graphics primitives with a multiresolution hash encoding,
T. M ¨uller, A. Evans, C. Schied, and A. Keller, “Instant neural graphics primitives with a multiresolution hash encoding,” ACM Trans. Graph. , vol. 41, no. 4, pp. 102:1–102:15, Jul. 2022
work page 2022
-
[34]
NeRF in the dark: High dynamic range view synthesis from noisy raw images,
B. Mildenhall, P. Hedman, R. Martin-Brualla, P. P. Srinivasan, and J. T. Barron, “NeRF in the dark: High dynamic range view synthesis from noisy raw images,” in Proc. CVPR, 2022
work page 2022
-
[35]
ESPIRiT–an eigenvalue approach to autocalibrating parallel MRI: Where SENSE meets GRAPPA,
M. Uecker et al. , “ESPIRiT–an eigenvalue approach to autocalibrating parallel MRI: Where SENSE meets GRAPPA,” Magn. Reson. Med. , vol. 71, no. 3, pp. 990–1001, Mar. 2014
work page 2014
-
[36]
A software channel compression technique for faster reconstruction with many channels,
F. Huang, S. Vijayakumar, Y . Li, S. Hertel, and G. R. Duensing, “A software channel compression technique for faster reconstruction with many channels,” Magn. Reson. Imag., vol. 26, no. 1, pp. 133–141, 2008
work page 2008
-
[37]
Image reconstruction: An overview for clinicians,
M. S. Hansen and P. Kellman, “Image reconstruction: An overview for clinicians,” J. Magn. Reson. Imaging , vol. 41, no. 3, pp. 573–585, Mar. 2015
work page 2015
-
[38]
Nonuniform fast Fourier transforms using min-max interpolation,
J. A. Fessler and B. P. Sutton, “Nonuniform fast Fourier transforms using min-max interpolation,” IEEE Trans. Signal Process. , vol. 51, no. 2, pp. 560–574, 2003
work page 2003
-
[39]
TorchKbNufft: A high-level, hardware-agnostic non-uniform fast Fourier transform,
M. J. Muckley, R. Stern, T. Murrell, and F. Knoll, “TorchKbNufft: A high-level, hardware-agnostic non-uniform fast Fourier transform,” in ISMRM Workshop on Data Sampling & Image Reconstruction , 2020
work page 2020
-
[40]
CG-SENSE revisited: Results from the first ISMRM reproducibility challenge,
O. Maier et al. , “CG-SENSE revisited: Results from the first ISMRM reproducibility challenge,” Magn. Reson. Med. , vol. 85, no. 4, pp. 1821– 1839, Apr. 2021
work page 2021
-
[41]
Time-optimal multidimensional gradient waveform design for rapid imaging,
B. A. Hargreaves, D. G. Nishimura, and S. M. Conolly, “Time-optimal multidimensional gradient waveform design for rapid imaging,” Magn. Reson. Med. , vol. 51, no. 1, pp. 81–92, 2004
work page 2004
-
[42]
Acorn: adaptive coordinate networks for neural scene representation,
J. N. P. Martel, D. B. Lindell, C. Z. Lin, E. R. Chan, M. Monteiro, and G. Wetzstein, “Acorn: adaptive coordinate networks for neural scene representation,” ACM Trans. Graph. , vol. 40, no. 4, Jul. 2021
work page 2021
-
[43]
Sampling density compensation in MRI: Rationale and an iterative numerical solution,
J. G. Pipe and P. Menon, “Sampling density compensation in MRI: Rationale and an iterative numerical solution,” Magn. Reson. Med. , vol. 41, no. 1, pp. 179–186, Jan. 1999
work page 1999
-
[44]
Implicit neural representations with periodic activation functions,
V . Sitzmann, J. N. Martel, A. W. Bergman, D. B. Lindell, and G. Wetzstein, “Implicit neural representations with periodic activation functions,” in Proc. NIPS, 2020
work page 2020
-
[45]
Dying ReLU and initialization: Theory and numerical examples,
L. Lu, Y . Shin, Y . Su, and G. E. Karniadakis, “Dying ReLU and initialization: Theory and numerical examples,” Communications in Computational Physics , vol. 28, no. 5, p. 1671–1706, Jan. 2020. [Online]. Available: http://dx.doi.org/10.4208/cicp.OA-2020-0165
-
[46]
C. M. Bishop, Pattern Recognition and Machine Learning . Springer, 2006
work page 2006
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.