pith. sign in

arxiv: 2606.30340 · v1 · pith:G2CN5ERHnew · submitted 2026-06-29 · 🧮 math.NA · cs.NA· math.AP

Adjoint-Based Bayesian Uncertainty Quantification for PDE-Constrained Inverse Problems with Application to Semiconductor Imaging

Pith reviewed 2026-06-30 04:59 UTC · model grok-4.3

classification 🧮 math.NA cs.NAmath.AP
keywords Bayesian inverse problemsPDE-constrained inferencesemiconductor doping reconstructionpushforward prioradjoint methodNo-U-Turn Sampleruncertainty quantificationpiecewise-constant fields
0
0 comments X

The pith

A pushforward prior from a latent Matérn Gaussian field through a sigmoid, paired with adjoint NUTS sampling, reconstructs piecewise-constant doping interfaces in semiconductors while quantifying uncertainty.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper formulates a Bayesian inverse problem for recovering doping profiles in pn-junction devices from boundary flux data, where the unknown is a piecewise-constant field with an unknown interface and two plateau values. It builds a pushforward prior by transforming a truncated Karhunen-Loève expansion of a Matérn Gaussian field via a sigmoid to produce differentiable approximations of sharp interfaces. Well-posedness is established by proving the forward Poisson-Boltzmann map is Lipschitz continuous and the posterior is stable in the Hellinger metric. Posterior sampling is performed with the No-U-Turn Sampler using gradients from the adjoint method, and experiments demonstrate one to two orders of magnitude improvement in effective sample size over preconditioned Crank-Nicolson. Reconstructions are shown for both known and jointly inferred plateaus, with the latter exposing posterior correlations that indicate non-identifiability.

Core claim

The central claim is that mapping a latent Gaussian field with Matérn covariance through a sigmoid produces a usable pushforward prior for piecewise-constant doping fields, and that this prior combined with adjoint-computed gradients enables the No-U-Turn Sampler to explore the posterior of the nonlinear PDE-constrained inverse problem more efficiently than dimension-robust alternatives while satisfying the well-posedness conditions of forward-map Lipschitz continuity and Hellinger posterior stability.

What carries the argument

The pushforward prior that applies a sigmoid transformation to a truncated Karhunen-Loève expansion of a Matérn Gaussian random field to generate differentiable approximations of piecewise-constant doping profiles with controllable interface sharpness.

If this is right

  • In the known-plateau setting the method recovers both planar and curved interfaces together with spatially resolved uncertainty maps.
  • When interface geometry and plateau concentrations are inferred jointly, posterior correlations reveal structural non-identifiability.
  • The combination of the proposed prior and NUTS produces effective sample sizes one to two orders of magnitude larger than those obtained with the preconditioned Crank-Nicolson sampler.
  • The Bayesian formulation satisfies well-posedness through Lipschitz continuity of the forward map and Hellinger stability of the posterior.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The observed non-identifiability when plateaus are unknown indicates that supplementary measurements or explicit constraints on concentration values would be needed for unique recovery.
  • Because the adjoint supplies gradients at essentially the cost of one forward solve, the sampling approach remains computationally feasible when the number of KL modes or the spatial resolution is increased.
  • The same sigmoid-pushforward construction could be transferred to other PDE inverse problems that require reconstruction of sharp material interfaces while preserving differentiability for gradient-based inference.

Load-bearing premise

The unknown doping field is assumed to be exactly piecewise constant with a single interface separating two constant plateau concentrations.

What would settle it

A numerical experiment on synthetic boundary flux data generated from a known piecewise-constant doping profile in which the NUTS sampler fails to produce effective sample sizes at least ten times larger than those from the pCN sampler, or in which small perturbations of the data yield posteriors whose Hellinger distance exceeds the stability bound proved in the paper.

Figures

Figures reproduced from arXiv: 2606.30340 by Babak Maboudi Afkham, Hassan Yazdanian, Leila Taghizadeh.

Figure 1
Figure 1. Figure 1: A two-dimensional pn-junction with a piecewise-constant doping function [PITH_FULL_IMAGE:figures/full_fig_p005_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Dependency chart for finite-dimensional parameters in the semiconductor inverse [PITH_FULL_IMAGE:figures/full_fig_p012_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Representative realizations from the pushforward Whittle–Mat´ern prior for the doping [PITH_FULL_IMAGE:figures/full_fig_p016_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Reconstruction results for the known-plateau setting with a straight interface (Ex [PITH_FULL_IMAGE:figures/full_fig_p017_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Trace plots of five representative KL coefficients for the reconstruction problem with [PITH_FULL_IMAGE:figures/full_fig_p018_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Reconstruction results for the known-plateau setting with a curved interface (Ex [PITH_FULL_IMAGE:figures/full_fig_p019_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Joint reconstruction results. (a) Posterior mean and (b) posterior standard deviation [PITH_FULL_IMAGE:figures/full_fig_p021_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Trace plots for representative KL coefficients in the joint reconstruction experiment. [PITH_FULL_IMAGE:figures/full_fig_p022_8.png] view at source ↗
read the original abstract

We formulate a Bayesian framework for reconstructing doping profiles in pn-junction semiconductor devices from boundary flux measurements. The unknown doping field is modeled as a piecewise-constant function characterized by an unknown interface and two plateau concentrations, leading to a nonlinear ill-posed inverse problem governed by a Poisson-Boltzmann-type equation. To represent this structure while enabling efficient gradient-based inference, we introduce a pushforward prior constructed by mapping a latent Gaussian field with Mat\'ern-type covariance through a sigmoid transformation. The latent field is parameterized by a truncated Karhunen-Lo\`eve expansion, while the two piecewise-constant levels are represented by scalar plateau parameters. The prior yields differentiable approximations of piecewise-constant fields with controllable interface sharpness. We establish well-posedness of the Bayesian formulation by proving Lipschitz continuity of the forward map and Hellinger stability of the posterior. We then sample the posterior using the No-U-Turn Sampler (NUTS) with gradients computed by the adjoint method. Numerical experiments show that the combination of the proposed prior and NUTS provides more efficient posterior exploration than the dimension-robust preconditioned Crank-Nicolson (pCN) sampler, yielding one to two orders of magnitude larger effective sample sizes. In the known-plateau setting, the method reconstructs both planar and curved interfaces and provides spatially resolved uncertainty quantification (UQ). When the interface geometry and plateau concentrations are inferred jointly, posterior correlations reveal structural non-identifiability. These results demonstrate the effectiveness of combining pushforward priors with adjoint-gradient-based sampling for reliable UQ in nonlinear partial differential equation-constrained inverse problems with sharp interfaces.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 3 minor

Summary. The paper formulates a Bayesian inverse problem for reconstructing piecewise-constant doping profiles (unknown interface plus two plateau values) in pn-junction devices from boundary flux data governed by a nonlinear Poisson-Boltzmann equation. A pushforward prior is constructed by applying a sigmoid transformation to a latent Matérn Gaussian random field parameterized via truncated KL expansion, yielding differentiable approximations to sharp interfaces. Well-posedness is established by proving Lipschitz continuity of the forward map and Hellinger stability of the posterior; posterior sampling is performed with adjoint-enabled NUTS, which is reported to produce 1–2 orders of magnitude higher effective sample sizes than pCN. Numerical results include interface reconstruction, spatially resolved UQ, and detection of structural non-identifiability when plateaus and geometry are inferred jointly.

Significance. If the Lipschitz and Hellinger arguments hold and the reported ESS gains are reproducible, the work supplies a concrete, gradient-enabled workflow for Bayesian UQ on nonlinear PDE-constrained problems that possess sharp-interface structure. The explicit reporting of non-identifiability and the use of a structurally informed pushforward prior are positive contributions that can be reused in related imaging or materials inverse problems.

minor comments (3)
  1. The abstract and introduction should state the specific KL truncation rank and Matérn parameters employed in the numerical examples so that the efficiency comparison can be reproduced without consulting the full text.
  2. Notation for the plateau scalars and the sigmoid sharpness parameter should be introduced once and used consistently; occasional reuse of the same symbol for different quantities appears in the provided abstract.
  3. A short paragraph comparing the computational cost of the adjoint gradient evaluation versus finite-difference checks would strengthen the claim that the method is practical for higher-dimensional KL expansions.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of the manuscript, the recognition of the pushforward prior construction, adjoint-enabled NUTS sampling, and the explicit treatment of non-identifiability. The recommendation for minor revision is appreciated. No major comments were raised in the report.

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The central claims rest on explicit modeling choices (piecewise-constant doping via pushforward of Matérn field through sigmoid) and direct mathematical proofs (Lipschitz continuity of the forward map from doping to boundary flux, Hellinger stability of the posterior) that invoke standard PDE estimates for the nonlinear Poisson-Boltzmann equation under bounded coefficients. Efficiency comparisons are obtained from numerical experiments with NUTS versus pCN, not by construction from fitted parameters. The truncated KL parameterization and plateau scalars are stated modeling decisions whose consequences (including reported non-identifiability) are exhibited rather than assumed away. No load-bearing step reduces by the paper's own equations or self-citation to a tautological input; the derivation chain is self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

3 free parameters · 2 axioms · 1 invented entities

The central claims rest on the piecewise-constant doping model, the differentiability properties of the sigmoid pushforward, the Lipschitz and Hellinger stability proofs, and the numerical comparison of samplers; these elements are introduced or asserted within the paper rather than taken from independent external benchmarks.

free parameters (3)
  • plateau concentrations
    Two scalar parameters representing the constant doping levels in each region of the piecewise-constant model.
  • KL truncation level
    Number of terms retained in the truncated Karhunen-Loève expansion of the latent Gaussian field.
  • Matérn covariance parameters
    Parameters controlling correlation length and smoothness of the latent field before the sigmoid transformation.
axioms (2)
  • domain assumption The forward map defined by the Poisson-Boltzmann-type equation is Lipschitz continuous with respect to the doping field.
    Invoked to establish well-posedness of the Bayesian formulation.
  • domain assumption The posterior measure is stable in the Hellinger metric under perturbations of the data.
    Proven as part of the well-posedness result.
invented entities (1)
  • pushforward prior via sigmoid transformation of latent Matérn field no independent evidence
    purpose: To produce differentiable approximations to piecewise-constant fields with controllable interface sharpness for use inside gradient-based sampling.
    New construction introduced to represent the structured doping field while remaining compatible with NUTS.

pith-pipeline@v0.9.1-grok · 5836 in / 1726 out tokens · 60868 ms · 2026-06-30T04:59:34.388089+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

44 extracted references

  1. [1]

    Cambridge University Press, 2011

    David Esseni, Pierpaolo Palestri, and Luca Selmi.Nanoscale MOS transistors: semi- classical transport and applications. Cambridge University Press, 2011

  2. [2]

    Springer, 2006

    Mark S Lundstrom and Jing Guo.Nanoscale transistors: device physics, modeling and simulation. Springer, 2006

  3. [3]

    The Theory of p-n Junctions in Semiconductors and p-n Junction Tran- sistors.Bell system technical journal, 28(3):435–489, 1949

    William Shockley. The Theory of p-n Junctions in Semiconductors and p-n Junction Tran- sistors.Bell system technical journal, 28(3):435–489, 1949

  4. [4]

    p-n Junction Transistors.Physical Review, 83(1):151, 1951

    William Shockley, Morgan Sparks, and Gordon K Teal. p-n Junction Transistors.Physical Review, 83(1):151, 1951

  5. [5]

    Doping profile analysis in Si by electrochemical capacitance-voltage measurements.Journal of the Electrochemical Society, 142(2):576–580, 1995

    E Peiner, A Schlachetzki, and D Kr¨ uger. Doping profile analysis in Si by electrochemical capacitance-voltage measurements.Journal of the Electrochemical Society, 142(2):576–580, 1995

  6. [6]

    E. F. Schubert, editor.Delta-Doping of Semiconductors. Cambridge University Press, Cambridge, 1996

  7. [7]

    Identification of doping profiles in semiconductor devices.Inverse Problems, 17(6):1765–1795, 2001

    Martin Burger, Heinz W Engl, Peter A Markowich, and Paola Pietra. Identification of doping profiles in semiconductor devices.Inverse Problems, 17(6):1765–1795, 2001

  8. [8]

    Recovering doping profiles in semicon- ductor devices with the Boltzmann–Poisson model.Journal of Computational Physics, 230(9):3391–3412, 2011

    Yingda Cheng, Irene M Gamba, and Kui Ren. Recovering doping profiles in semicon- ductor devices with the Boltzmann–Poisson model.Journal of Computational Physics, 230(9):3391–3412, 2011

  9. [9]

    Semiconductors and Dirichlet-to-Neumann maps.Computational & Ap- plied Mathematics, 25(2-3):187–203, 2006

    Antonio Leit˜ ao. Semiconductors and Dirichlet-to-Neumann maps.Computational & Ap- plied Mathematics, 25(2-3):187–203, 2006

  10. [10]

    Springer Science & Business Media, 2012

    Peter A Markowich, Christian A Ringhofer, and Christian Schmeiser.Semiconductor equa- tions. Springer Science & Business Media, 2012

  11. [11]

    Springer Science & Business Media, 1985

    Peter A Markowich.The stationary semiconductor device equations. Springer Science & Business Media, 1985

  12. [12]

    Springer Science & Business Media, 1984

    Siegfried Selberherr.Analysis and simulation of semiconductor devices. Springer Science & Business Media, 1984

  13. [13]

    Springer, 2009

    Ansgar J¨ ungel.Transport equations for semiconductors. Springer, 2009

  14. [14]

    The optimal multilevel Monte-Carlo approximation of the stochastic drift–diffusion-Poisson system.Computer Methods in Applied Mechanics and Engineering, 318:739–761, 2017

    Leila Taghizadeh, Amirreza Khodadadian, and Clemens Heitzinger. The optimal multilevel Monte-Carlo approximation of the stochastic drift–diffusion-Poisson system.Computer Methods in Applied Mechanics and Engineering, 318:739–761, 2017

  15. [15]

    Optimal multilevel randomized quasi-Monte-Carlo method for the stochastic drift–diffusion-Poisson system

    Amirreza Khodadadian, Leila Taghizadeh, and Clemens Heitzinger. Optimal multilevel randomized quasi-Monte-Carlo method for the stochastic drift–diffusion-Poisson system. Computer Methods in Applied Mechanics and Engineering, 329:480–497, 2018

  16. [16]

    Bayesian estimation of physical and geo- metrical parameters for nanocapacitor array biosensors.Journal of Computational Physics, 397:108874, 2019

    Benjamin Stadlbauer, Andrea Cossettini, Daniel Pasterk, Paolo Scarbolo, Leila Taghizadeh, Clemens Heitzinger, Luca Selmi, et al. Bayesian estimation of physical and geo- metrical parameters for nanocapacitor array biosensors.Journal of Computational Physics, 397:108874, 2019

  17. [17]

    Bayesian inversion for the identification of the dop- ing profile in unipolar semiconductor devices.SIAM Journal on Scientific Computing, 47(3):B690–B709, 2025

    Leila Taghizadeh and Ansgar J¨ ungel. Bayesian inversion for the identification of the dop- ing profile in unipolar semiconductor devices.SIAM Journal on Scientific Computing, 47(3):B690–B709, 2025. 27

  18. [18]

    Determination of micro-and nano-particle properties by multi-frequency Bayesian methods and applications to nanoelectrode array sensors

    Andrea Cossettini, Benjamin Stadlbauer, Jose Morales Escalante, Leila Taghizadeh, Luca Selmi, and Clemens Heitzinger. Determination of micro-and nano-particle properties by multi-frequency Bayesian methods and applications to nanoelectrode array sensors. In2019 IEEE SENSORS, pages 1–4. IEEE, 2019

  19. [19]

    Inverse problems: a Bayesian perspective.Acta numerica, 19:451–559, 2010

    Andrew M Stuart. Inverse problems: a Bayesian perspective.Acta numerica, 19:451–559, 2010

  20. [20]

    Springer, 2005

    Jari P Kaipio and Erkki Somersalo.Statistical and computational inverse problems. Springer, 2005

  21. [21]

    SIAM, 2024

    Ralph C Smith.Uncertainty quantification: theory, implementation, and applications. SIAM, 2024

  22. [22]

    A stochastic Newton MCMC method for large-scale statistical inverse problems with application to seismic inversion.SIAM Journal on Scientific Computing, 34(3):A1460–A1487, 2012

    James Martin, Lucas C Wilcox, Carsten Burstedde, and Omar Ghattas. A stochastic Newton MCMC method for large-scale statistical inverse problems with application to seismic inversion.SIAM Journal on Scientific Computing, 34(3):A1460–A1487, 2012

  23. [23]

    Solving large-scale PDE-constrained Bayesian in- verse problems with Riemann manifold Hamiltonian Monte Carlo.Inverse Problems, 30(11):114014, 2014

    Tan Bui-Thanh and Mark Girolami. Solving large-scale PDE-constrained Bayesian in- verse problems with Riemann manifold Hamiltonian Monte Carlo.Inverse Problems, 30(11):114014, 2014

  24. [24]

    Geometric MCMC for infinite-dimensional inverse problems.Journal of Computational Physics, 335:327–351, 2017

    Alexandros Beskos, Mark Girolami, Shiwei Lan, Patrick E Farrell, and Andrew M Stuart. Geometric MCMC for infinite-dimensional inverse problems.Journal of Computational Physics, 335:327–351, 2017

  25. [25]

    Springer Science & Business Media, 2008

    Michael Hinze, Ren´ e Pinnau, Michael Ulbrich, and Stefan Ulbrich.Optimization with PDE constraints. Springer Science & Business Media, 2008

  26. [26]

    FEM-based discretization-invariant MCMC methods for PDE-constrained Bayesian inverse problems.Inverse Probl

    Tan Bui-Thanh and Quoc P Nguyen. FEM-based discretization-invariant MCMC methods for PDE-constrained Bayesian inverse problems.Inverse Probl. Imaging, 10(4):943–975, 2016

  27. [27]

    Hamiltonian Monte Carlo solution of tomographic inverse problems.Geophysical Journal International, 216(2):1344–1363, 2019

    Andreas Fichtner, Andrea Zunino, and Lars Gebraad. Hamiltonian Monte Carlo solution of tomographic inverse problems.Geophysical Journal International, 216(2):1344–1363, 2019

  28. [28]

    Michael C Koch, Kazunori Fujisawa, and Akira Murakami. Adjoint Hamiltonian Monte Carlo algorithm for the estimation of elastic modulus through the inversion of elastic wave propagation data.International Journal for Numerical Methods in Engineering, 121(6):1037–1067, 2020

  29. [29]

    HMCLab: a framework for solving diverse geophysical inverse problems using the Hamiltonian Monte Carlo method.Geophysical Journal International, 235(3):2979–2991, 2023

    Andrea Zunino, Lars Gebraad, Alessandro Ghirotto, and Andreas Fichtner. HMCLab: a framework for solving diverse geophysical inverse problems using the Hamiltonian Monte Carlo method.Geophysical Journal International, 235(3):2979–2991, 2023

  30. [30]

    Horseshoe priors for edge-preserving linear Bayesian inversion.SIAM Journal on Scientific Computing, 45(3):B337–B365, 2023

    Felipe Uribe, Yiqiu Dong, and Per Christian Hansen. Horseshoe priors for edge-preserving linear Bayesian inversion.SIAM Journal on Scientific Computing, 45(3):B337–B365, 2023

  31. [31]

    The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo.J

    Matthew D Hoffman, Andrew Gelman, et al. The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo.J. Mach. Learn. Res., 15(1):1593–1623, 2014

  32. [32]

    Roberts and Richard L

    Gareth O. Roberts and Richard L. Tweedie. Exponential convergence of Langevin distri- butions and their discrete approximations.Bernoulli, 2(4):341 – 363, 1996

  33. [33]

    Optimal scaling of discrete approximations to Langevin diffusions.Journal of the Royal Statistical Society: Series B (Statistical Method- ology), 60(1):255–268, 1998

    Gareth O Roberts and Jeffrey S Rosenthal. Optimal scaling of discrete approximations to Langevin diffusions.Journal of the Royal Statistical Society: Series B (Statistical Method- ology), 60(1):255–268, 1998. 28

  34. [34]

    MCMC methods for functions: modifying old algorithms to make them faster.Statistical Science, pages 424–446, 2013

    Simon L Cotter, Gareth O Roberts, Andrew M Stuart, and David White. MCMC methods for functions: modifying old algorithms to make them faster.Statistical Science, pages 424–446, 2013

  35. [35]

    On inverse prob- lems for semiconductor equations.Milan Journal of Mathematics, 72(1):273–313, 2004

    Martin Burger, Heinz W Engl, Antonio Leit˜ ao, and Peter A Markowich. On inverse prob- lems for semiconductor equations.Milan Journal of Mathematics, 72(1):273–313, 2004

  36. [36]

    Springer Science & Business Media, 2006

    Alfio Quarteroni, Riccardo Sacco, and Fausto Saleri.Numerical mathematics, volume 37. Springer Science & Business Media, 2006

  37. [37]

    Springer Science & Business Media, 2012

    Ildar Abdulovich Ibragimov and Yurii Antol’evich Rozanov.Gaussian random processes. Springer Science & Business Media, 2012

  38. [38]

    Hierarchical Bayesian level set inversion.Statistics and Computing, 27(6):1555–1584, 2017

    Matthew M Dunlop, Marco A Iglesias, and Andrew M Stuart. Hierarchical Bayesian level set inversion.Statistics and Computing, 27(6):1555–1584, 2017

  39. [39]

    On stationary processes in the plane.Biometrika, 41(3/4):434–449, 1954

    Peter Whittle. On stationary processes in the plane.Biometrika, 41(3/4):434–449, 1954

  40. [40]

    G. Ali, I. Torcicollo, and S. Vessella. Inverse doping problems for a P-N junction.Journal of Numerical Mathematics, 14(6):537–546, 2006

  41. [41]

    McGraw-Hill, New York, 3rd edition, 1987

    Walter Rudin.Real and Complex Analysis. McGraw-Hill, New York, 3rd edition, 1987

  42. [42]

    Springer Science & Business Media, 2000

    Lucien Le Cam and Grace Lo Yang.Asymptotics in statistics: some basic concepts. Springer Science & Business Media, 2000

  43. [43]

    Image quality assess- ment: from error visibility to structural similarity.IEEE transactions on image processing, 13(4):600–612, 2004

    Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. Image quality assess- ment: from error visibility to structural similarity.IEEE transactions on image processing, 13(4):600–612, 2004

  44. [44]

    Owen.Monte Carlo Theory, Methods and Examples.https://artowen.su

    Art B. Owen.Monte Carlo Theory, Methods and Examples.https://artowen.su. domains/mc/, 2013. Chapter 11. 29