Adjoint-Based Bayesian Uncertainty Quantification for PDE-Constrained Inverse Problems with Application to Semiconductor Imaging
Pith reviewed 2026-06-30 04:59 UTC · model grok-4.3
The pith
A pushforward prior from a latent Matérn Gaussian field through a sigmoid, paired with adjoint NUTS sampling, reconstructs piecewise-constant doping interfaces in semiconductors while quantifying uncertainty.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that mapping a latent Gaussian field with Matérn covariance through a sigmoid produces a usable pushforward prior for piecewise-constant doping fields, and that this prior combined with adjoint-computed gradients enables the No-U-Turn Sampler to explore the posterior of the nonlinear PDE-constrained inverse problem more efficiently than dimension-robust alternatives while satisfying the well-posedness conditions of forward-map Lipschitz continuity and Hellinger posterior stability.
What carries the argument
The pushforward prior that applies a sigmoid transformation to a truncated Karhunen-Loève expansion of a Matérn Gaussian random field to generate differentiable approximations of piecewise-constant doping profiles with controllable interface sharpness.
If this is right
- In the known-plateau setting the method recovers both planar and curved interfaces together with spatially resolved uncertainty maps.
- When interface geometry and plateau concentrations are inferred jointly, posterior correlations reveal structural non-identifiability.
- The combination of the proposed prior and NUTS produces effective sample sizes one to two orders of magnitude larger than those obtained with the preconditioned Crank-Nicolson sampler.
- The Bayesian formulation satisfies well-posedness through Lipschitz continuity of the forward map and Hellinger stability of the posterior.
Where Pith is reading between the lines
- The observed non-identifiability when plateaus are unknown indicates that supplementary measurements or explicit constraints on concentration values would be needed for unique recovery.
- Because the adjoint supplies gradients at essentially the cost of one forward solve, the sampling approach remains computationally feasible when the number of KL modes or the spatial resolution is increased.
- The same sigmoid-pushforward construction could be transferred to other PDE inverse problems that require reconstruction of sharp material interfaces while preserving differentiability for gradient-based inference.
Load-bearing premise
The unknown doping field is assumed to be exactly piecewise constant with a single interface separating two constant plateau concentrations.
What would settle it
A numerical experiment on synthetic boundary flux data generated from a known piecewise-constant doping profile in which the NUTS sampler fails to produce effective sample sizes at least ten times larger than those from the pCN sampler, or in which small perturbations of the data yield posteriors whose Hellinger distance exceeds the stability bound proved in the paper.
Figures
read the original abstract
We formulate a Bayesian framework for reconstructing doping profiles in pn-junction semiconductor devices from boundary flux measurements. The unknown doping field is modeled as a piecewise-constant function characterized by an unknown interface and two plateau concentrations, leading to a nonlinear ill-posed inverse problem governed by a Poisson-Boltzmann-type equation. To represent this structure while enabling efficient gradient-based inference, we introduce a pushforward prior constructed by mapping a latent Gaussian field with Mat\'ern-type covariance through a sigmoid transformation. The latent field is parameterized by a truncated Karhunen-Lo\`eve expansion, while the two piecewise-constant levels are represented by scalar plateau parameters. The prior yields differentiable approximations of piecewise-constant fields with controllable interface sharpness. We establish well-posedness of the Bayesian formulation by proving Lipschitz continuity of the forward map and Hellinger stability of the posterior. We then sample the posterior using the No-U-Turn Sampler (NUTS) with gradients computed by the adjoint method. Numerical experiments show that the combination of the proposed prior and NUTS provides more efficient posterior exploration than the dimension-robust preconditioned Crank-Nicolson (pCN) sampler, yielding one to two orders of magnitude larger effective sample sizes. In the known-plateau setting, the method reconstructs both planar and curved interfaces and provides spatially resolved uncertainty quantification (UQ). When the interface geometry and plateau concentrations are inferred jointly, posterior correlations reveal structural non-identifiability. These results demonstrate the effectiveness of combining pushforward priors with adjoint-gradient-based sampling for reliable UQ in nonlinear partial differential equation-constrained inverse problems with sharp interfaces.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper formulates a Bayesian inverse problem for reconstructing piecewise-constant doping profiles (unknown interface plus two plateau values) in pn-junction devices from boundary flux data governed by a nonlinear Poisson-Boltzmann equation. A pushforward prior is constructed by applying a sigmoid transformation to a latent Matérn Gaussian random field parameterized via truncated KL expansion, yielding differentiable approximations to sharp interfaces. Well-posedness is established by proving Lipschitz continuity of the forward map and Hellinger stability of the posterior; posterior sampling is performed with adjoint-enabled NUTS, which is reported to produce 1–2 orders of magnitude higher effective sample sizes than pCN. Numerical results include interface reconstruction, spatially resolved UQ, and detection of structural non-identifiability when plateaus and geometry are inferred jointly.
Significance. If the Lipschitz and Hellinger arguments hold and the reported ESS gains are reproducible, the work supplies a concrete, gradient-enabled workflow for Bayesian UQ on nonlinear PDE-constrained problems that possess sharp-interface structure. The explicit reporting of non-identifiability and the use of a structurally informed pushforward prior are positive contributions that can be reused in related imaging or materials inverse problems.
minor comments (3)
- The abstract and introduction should state the specific KL truncation rank and Matérn parameters employed in the numerical examples so that the efficiency comparison can be reproduced without consulting the full text.
- Notation for the plateau scalars and the sigmoid sharpness parameter should be introduced once and used consistently; occasional reuse of the same symbol for different quantities appears in the provided abstract.
- A short paragraph comparing the computational cost of the adjoint gradient evaluation versus finite-difference checks would strengthen the claim that the method is practical for higher-dimensional KL expansions.
Simulated Author's Rebuttal
We thank the referee for the positive assessment of the manuscript, the recognition of the pushforward prior construction, adjoint-enabled NUTS sampling, and the explicit treatment of non-identifiability. The recommendation for minor revision is appreciated. No major comments were raised in the report.
Circularity Check
No significant circularity identified
full rationale
The central claims rest on explicit modeling choices (piecewise-constant doping via pushforward of Matérn field through sigmoid) and direct mathematical proofs (Lipschitz continuity of the forward map from doping to boundary flux, Hellinger stability of the posterior) that invoke standard PDE estimates for the nonlinear Poisson-Boltzmann equation under bounded coefficients. Efficiency comparisons are obtained from numerical experiments with NUTS versus pCN, not by construction from fitted parameters. The truncated KL parameterization and plateau scalars are stated modeling decisions whose consequences (including reported non-identifiability) are exhibited rather than assumed away. No load-bearing step reduces by the paper's own equations or self-citation to a tautological input; the derivation chain is self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (3)
- plateau concentrations
- KL truncation level
- Matérn covariance parameters
axioms (2)
- domain assumption The forward map defined by the Poisson-Boltzmann-type equation is Lipschitz continuous with respect to the doping field.
- domain assumption The posterior measure is stable in the Hellinger metric under perturbations of the data.
invented entities (1)
-
pushforward prior via sigmoid transformation of latent Matérn field
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Cambridge University Press, 2011
David Esseni, Pierpaolo Palestri, and Luca Selmi.Nanoscale MOS transistors: semi- classical transport and applications. Cambridge University Press, 2011
2011
-
[2]
Springer, 2006
Mark S Lundstrom and Jing Guo.Nanoscale transistors: device physics, modeling and simulation. Springer, 2006
2006
-
[3]
The Theory of p-n Junctions in Semiconductors and p-n Junction Tran- sistors.Bell system technical journal, 28(3):435–489, 1949
William Shockley. The Theory of p-n Junctions in Semiconductors and p-n Junction Tran- sistors.Bell system technical journal, 28(3):435–489, 1949
1949
-
[4]
p-n Junction Transistors.Physical Review, 83(1):151, 1951
William Shockley, Morgan Sparks, and Gordon K Teal. p-n Junction Transistors.Physical Review, 83(1):151, 1951
1951
-
[5]
Doping profile analysis in Si by electrochemical capacitance-voltage measurements.Journal of the Electrochemical Society, 142(2):576–580, 1995
E Peiner, A Schlachetzki, and D Kr¨ uger. Doping profile analysis in Si by electrochemical capacitance-voltage measurements.Journal of the Electrochemical Society, 142(2):576–580, 1995
1995
-
[6]
E. F. Schubert, editor.Delta-Doping of Semiconductors. Cambridge University Press, Cambridge, 1996
1996
-
[7]
Identification of doping profiles in semiconductor devices.Inverse Problems, 17(6):1765–1795, 2001
Martin Burger, Heinz W Engl, Peter A Markowich, and Paola Pietra. Identification of doping profiles in semiconductor devices.Inverse Problems, 17(6):1765–1795, 2001
2001
-
[8]
Recovering doping profiles in semicon- ductor devices with the Boltzmann–Poisson model.Journal of Computational Physics, 230(9):3391–3412, 2011
Yingda Cheng, Irene M Gamba, and Kui Ren. Recovering doping profiles in semicon- ductor devices with the Boltzmann–Poisson model.Journal of Computational Physics, 230(9):3391–3412, 2011
2011
-
[9]
Semiconductors and Dirichlet-to-Neumann maps.Computational & Ap- plied Mathematics, 25(2-3):187–203, 2006
Antonio Leit˜ ao. Semiconductors and Dirichlet-to-Neumann maps.Computational & Ap- plied Mathematics, 25(2-3):187–203, 2006
2006
-
[10]
Springer Science & Business Media, 2012
Peter A Markowich, Christian A Ringhofer, and Christian Schmeiser.Semiconductor equa- tions. Springer Science & Business Media, 2012
2012
-
[11]
Springer Science & Business Media, 1985
Peter A Markowich.The stationary semiconductor device equations. Springer Science & Business Media, 1985
1985
-
[12]
Springer Science & Business Media, 1984
Siegfried Selberherr.Analysis and simulation of semiconductor devices. Springer Science & Business Media, 1984
1984
-
[13]
Springer, 2009
Ansgar J¨ ungel.Transport equations for semiconductors. Springer, 2009
2009
-
[14]
The optimal multilevel Monte-Carlo approximation of the stochastic drift–diffusion-Poisson system.Computer Methods in Applied Mechanics and Engineering, 318:739–761, 2017
Leila Taghizadeh, Amirreza Khodadadian, and Clemens Heitzinger. The optimal multilevel Monte-Carlo approximation of the stochastic drift–diffusion-Poisson system.Computer Methods in Applied Mechanics and Engineering, 318:739–761, 2017
2017
-
[15]
Optimal multilevel randomized quasi-Monte-Carlo method for the stochastic drift–diffusion-Poisson system
Amirreza Khodadadian, Leila Taghizadeh, and Clemens Heitzinger. Optimal multilevel randomized quasi-Monte-Carlo method for the stochastic drift–diffusion-Poisson system. Computer Methods in Applied Mechanics and Engineering, 329:480–497, 2018
2018
-
[16]
Bayesian estimation of physical and geo- metrical parameters for nanocapacitor array biosensors.Journal of Computational Physics, 397:108874, 2019
Benjamin Stadlbauer, Andrea Cossettini, Daniel Pasterk, Paolo Scarbolo, Leila Taghizadeh, Clemens Heitzinger, Luca Selmi, et al. Bayesian estimation of physical and geo- metrical parameters for nanocapacitor array biosensors.Journal of Computational Physics, 397:108874, 2019
2019
-
[17]
Bayesian inversion for the identification of the dop- ing profile in unipolar semiconductor devices.SIAM Journal on Scientific Computing, 47(3):B690–B709, 2025
Leila Taghizadeh and Ansgar J¨ ungel. Bayesian inversion for the identification of the dop- ing profile in unipolar semiconductor devices.SIAM Journal on Scientific Computing, 47(3):B690–B709, 2025. 27
2025
-
[18]
Determination of micro-and nano-particle properties by multi-frequency Bayesian methods and applications to nanoelectrode array sensors
Andrea Cossettini, Benjamin Stadlbauer, Jose Morales Escalante, Leila Taghizadeh, Luca Selmi, and Clemens Heitzinger. Determination of micro-and nano-particle properties by multi-frequency Bayesian methods and applications to nanoelectrode array sensors. In2019 IEEE SENSORS, pages 1–4. IEEE, 2019
2019
-
[19]
Inverse problems: a Bayesian perspective.Acta numerica, 19:451–559, 2010
Andrew M Stuart. Inverse problems: a Bayesian perspective.Acta numerica, 19:451–559, 2010
2010
-
[20]
Springer, 2005
Jari P Kaipio and Erkki Somersalo.Statistical and computational inverse problems. Springer, 2005
2005
-
[21]
SIAM, 2024
Ralph C Smith.Uncertainty quantification: theory, implementation, and applications. SIAM, 2024
2024
-
[22]
A stochastic Newton MCMC method for large-scale statistical inverse problems with application to seismic inversion.SIAM Journal on Scientific Computing, 34(3):A1460–A1487, 2012
James Martin, Lucas C Wilcox, Carsten Burstedde, and Omar Ghattas. A stochastic Newton MCMC method for large-scale statistical inverse problems with application to seismic inversion.SIAM Journal on Scientific Computing, 34(3):A1460–A1487, 2012
2012
-
[23]
Solving large-scale PDE-constrained Bayesian in- verse problems with Riemann manifold Hamiltonian Monte Carlo.Inverse Problems, 30(11):114014, 2014
Tan Bui-Thanh and Mark Girolami. Solving large-scale PDE-constrained Bayesian in- verse problems with Riemann manifold Hamiltonian Monte Carlo.Inverse Problems, 30(11):114014, 2014
2014
-
[24]
Geometric MCMC for infinite-dimensional inverse problems.Journal of Computational Physics, 335:327–351, 2017
Alexandros Beskos, Mark Girolami, Shiwei Lan, Patrick E Farrell, and Andrew M Stuart. Geometric MCMC for infinite-dimensional inverse problems.Journal of Computational Physics, 335:327–351, 2017
2017
-
[25]
Springer Science & Business Media, 2008
Michael Hinze, Ren´ e Pinnau, Michael Ulbrich, and Stefan Ulbrich.Optimization with PDE constraints. Springer Science & Business Media, 2008
2008
-
[26]
FEM-based discretization-invariant MCMC methods for PDE-constrained Bayesian inverse problems.Inverse Probl
Tan Bui-Thanh and Quoc P Nguyen. FEM-based discretization-invariant MCMC methods for PDE-constrained Bayesian inverse problems.Inverse Probl. Imaging, 10(4):943–975, 2016
2016
-
[27]
Hamiltonian Monte Carlo solution of tomographic inverse problems.Geophysical Journal International, 216(2):1344–1363, 2019
Andreas Fichtner, Andrea Zunino, and Lars Gebraad. Hamiltonian Monte Carlo solution of tomographic inverse problems.Geophysical Journal International, 216(2):1344–1363, 2019
2019
-
[28]
Michael C Koch, Kazunori Fujisawa, and Akira Murakami. Adjoint Hamiltonian Monte Carlo algorithm for the estimation of elastic modulus through the inversion of elastic wave propagation data.International Journal for Numerical Methods in Engineering, 121(6):1037–1067, 2020
2020
-
[29]
HMCLab: a framework for solving diverse geophysical inverse problems using the Hamiltonian Monte Carlo method.Geophysical Journal International, 235(3):2979–2991, 2023
Andrea Zunino, Lars Gebraad, Alessandro Ghirotto, and Andreas Fichtner. HMCLab: a framework for solving diverse geophysical inverse problems using the Hamiltonian Monte Carlo method.Geophysical Journal International, 235(3):2979–2991, 2023
2023
-
[30]
Horseshoe priors for edge-preserving linear Bayesian inversion.SIAM Journal on Scientific Computing, 45(3):B337–B365, 2023
Felipe Uribe, Yiqiu Dong, and Per Christian Hansen. Horseshoe priors for edge-preserving linear Bayesian inversion.SIAM Journal on Scientific Computing, 45(3):B337–B365, 2023
2023
-
[31]
The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo.J
Matthew D Hoffman, Andrew Gelman, et al. The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo.J. Mach. Learn. Res., 15(1):1593–1623, 2014
2014
-
[32]
Roberts and Richard L
Gareth O. Roberts and Richard L. Tweedie. Exponential convergence of Langevin distri- butions and their discrete approximations.Bernoulli, 2(4):341 – 363, 1996
1996
-
[33]
Optimal scaling of discrete approximations to Langevin diffusions.Journal of the Royal Statistical Society: Series B (Statistical Method- ology), 60(1):255–268, 1998
Gareth O Roberts and Jeffrey S Rosenthal. Optimal scaling of discrete approximations to Langevin diffusions.Journal of the Royal Statistical Society: Series B (Statistical Method- ology), 60(1):255–268, 1998. 28
1998
-
[34]
MCMC methods for functions: modifying old algorithms to make them faster.Statistical Science, pages 424–446, 2013
Simon L Cotter, Gareth O Roberts, Andrew M Stuart, and David White. MCMC methods for functions: modifying old algorithms to make them faster.Statistical Science, pages 424–446, 2013
2013
-
[35]
On inverse prob- lems for semiconductor equations.Milan Journal of Mathematics, 72(1):273–313, 2004
Martin Burger, Heinz W Engl, Antonio Leit˜ ao, and Peter A Markowich. On inverse prob- lems for semiconductor equations.Milan Journal of Mathematics, 72(1):273–313, 2004
2004
-
[36]
Springer Science & Business Media, 2006
Alfio Quarteroni, Riccardo Sacco, and Fausto Saleri.Numerical mathematics, volume 37. Springer Science & Business Media, 2006
2006
-
[37]
Springer Science & Business Media, 2012
Ildar Abdulovich Ibragimov and Yurii Antol’evich Rozanov.Gaussian random processes. Springer Science & Business Media, 2012
2012
-
[38]
Hierarchical Bayesian level set inversion.Statistics and Computing, 27(6):1555–1584, 2017
Matthew M Dunlop, Marco A Iglesias, and Andrew M Stuart. Hierarchical Bayesian level set inversion.Statistics and Computing, 27(6):1555–1584, 2017
2017
-
[39]
On stationary processes in the plane.Biometrika, 41(3/4):434–449, 1954
Peter Whittle. On stationary processes in the plane.Biometrika, 41(3/4):434–449, 1954
1954
-
[40]
G. Ali, I. Torcicollo, and S. Vessella. Inverse doping problems for a P-N junction.Journal of Numerical Mathematics, 14(6):537–546, 2006
2006
-
[41]
McGraw-Hill, New York, 3rd edition, 1987
Walter Rudin.Real and Complex Analysis. McGraw-Hill, New York, 3rd edition, 1987
1987
-
[42]
Springer Science & Business Media, 2000
Lucien Le Cam and Grace Lo Yang.Asymptotics in statistics: some basic concepts. Springer Science & Business Media, 2000
2000
-
[43]
Image quality assess- ment: from error visibility to structural similarity.IEEE transactions on image processing, 13(4):600–612, 2004
Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. Image quality assess- ment: from error visibility to structural similarity.IEEE transactions on image processing, 13(4):600–612, 2004
2004
-
[44]
Owen.Monte Carlo Theory, Methods and Examples.https://artowen.su
Art B. Owen.Monte Carlo Theory, Methods and Examples.https://artowen.su. domains/mc/, 2013. Chapter 11. 29
2013
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.