pith. sign in

arxiv: 2606.08654 · v1 · pith:43OJNU2Lnew · submitted 2026-06-07 · 💻 cs.LG · cs.NA· math.AP· math.NA· stat.AP

Operator learning for the 2D incompressible Navier-Stokes equations: a conformal prediction approach in the data-scarce regime

Pith reviewed 2026-06-27 18:38 UTC · model grok-4.3

classification 💻 cs.LG cs.NAmath.APmath.NAstat.AP
keywords conformal predictionoperator learningNavier-Stokesuncertainty quantificationFourier Neural Operatordata-scarceperturbation
0
0 comments X

The pith

Perturbation-based conformal prediction produces narrower uncertainty bands for neural operators on 2D Navier-Stokes under fixed data budgets.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes wrapping a trained Fourier Neural Operator with split conformal prediction, where the local uncertainty scale comes from comparing predictions of two operators trained on original labels versus labels perturbed by small Gaussian noise. This is examined in the data-scarce regime with a fixed total label budget. On the 2D Navier-Stokes benchmark, it achieves substantially narrower conformal bands than existing methods while maintaining the target simultaneous coverage. The results indicate that perturbation sensitivity can serve as a practical and sample-efficient uncertainty proxy for conformalized neural operators.

Core claim

By training one Fourier Neural Operator on the original dataset and another on labels perturbed by small Gaussian noise, the difference in their predictions provides a local uncertainty scale. When this scale is used within split conformal prediction, the resulting intervals maintain the desired coverage but are narrower than those from competing approaches that require splitting the data budget across multiple models, as demonstrated on the 2D incompressible Navier-Stokes equations.

What carries the argument

The perturbation-based local uncertainty scale derived from the prediction difference between two FNOs trained on nearly identical but differently labeled datasets.

If this is right

  • The perturbation method maintains target simultaneous coverage on the benchmark.
  • It produces substantially narrower conformal bands than existing methods under matched total data budgets.
  • Perturbation sensitivity acts as a sample-efficient uncertainty proxy for neural operators.
  • The approach is suitable for data-scarce regimes where training separate uncertainty networks divides the data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the perturbation method generalizes, it could apply to other PDE operator learning tasks with limited data.
  • The method might allow for better uncertainty quantification without additional model training overhead.
  • Optimal choice of perturbation magnitude could be investigated further for different equations.

Load-bearing premise

Comparing predictions from operators trained on original and perturbed labels yields a valid local uncertainty scale that preserves conformal prediction coverage guarantees without systematic bias.

What would settle it

A test on the 2D Navier-Stokes benchmark where the perturbation-based conformal bands fail to achieve the target coverage level or are wider than those from baseline methods under the same total data budget.

Figures

Figures reproduced from arXiv: 2606.08654 by Bowen Gang, Hao Deng, Weinan Wang.

Figure 1
Figure 1. Figure 1: Qualitative perturbation-based uncertainty map. Left: ground-truth vorticity for a representative test trajectory. Right: conformal radius Q1−ασ(a) derived from the dis￾agreement between the base and perturbed operators [PITH_FULL_IMAGE:figures/full_fig_p011_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Base operator performance check. Left: ground-truth vorticity field for a repre￾sentative test sample. Center: FNO prediction. Right: absolute error [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗
read the original abstract

In this paper, we propose a perturbation-based conformal prediction framework for uncertainty quantification in operator learning, with a focus on the 2D Navier--Stokes equations. While neural operators provide fast surrogates for expensive PDE solvers, they do not by themselves provide calibrated uncertainty for spatiotemporal field predictions. Our approach wraps a trained Fourier Neural Operator (FNO) with split conformal prediction and constructs the local uncertainty scale by comparing the predictions of two operators trained on nearly identical datasets: one on the original labels and one on labels perturbed by small Gaussian noise. We consider this procedure in the data-scarce regime, where the total label budget is fixed and methods that require a separate uncertainty network must divide training data between multiple models. On the 2D Navier--Stokes benchmark, the perturbation-based method produces substantially narrower conformal bands than existing methods under matched total data budgets while maintaining the target simultaneous coverage. These results suggest that perturbation sensitivity is a practical and sample-efficient uncertainty proxy for conformalized neural operators.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes a perturbation-based conformal prediction framework for uncertainty quantification in operator learning for the 2D incompressible Navier-Stokes equations. It wraps a Fourier Neural Operator (FNO) with split conformal prediction, constructing local uncertainty scales by comparing predictions from two FNOs trained on original versus small-Gaussian-perturbed labels. In the data-scarce regime with fixed total label budgets, the method is claimed to yield substantially narrower conformal bands than existing approaches while maintaining target simultaneous coverage on the 2D NS benchmark.

Significance. If the coverage guarantee is preserved, the approach would provide a practical, sample-efficient uncertainty proxy for neural operators that avoids allocating separate data for an uncertainty network, which is valuable for high-dimensional spatiotemporal PDE surrogates where data is expensive. The empirical demonstration on a standard benchmark under matched budgets is a concrete strength.

major comments (2)
  1. [Abstract] Abstract: the central claim that the perturbation-based method maintains the target simultaneous coverage while producing narrower bands lacks any derivation, proof sketch, or experimental controls showing that the perturbation step preserves the conformal guarantee. The nonconformity scores (absolute differences between the two FNO predictions) are constructed from operators trained on nearly identical datasets, which risks violating the exchangeability assumption required for split conformal prediction marginal coverage.
  2. [Method] Method section (description of local scale construction): the local uncertainty scale is defined as the absolute difference between predictions of two FNOs trained on original and perturbed labels; no analysis is provided demonstrating that this scale is independent of the perturbation magnitude or that the resulting scores on calibration points remain exchangeable with test points, which is load-bearing for the coverage claim.
minor comments (2)
  1. [Abstract] The abstract refers to 'simultaneous coverage' without specifying whether this is marginal or joint over the spatiotemporal field; clarify the exact coverage statement and the quantile computation procedure.
  2. [Method] Notation for the perturbation variance and the two training sets should be introduced with explicit symbols to avoid ambiguity when describing the scale construction.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for their constructive comments. The concerns regarding the lack of theoretical justification for coverage preservation and exchangeability are valid points that we address below. We indicate revisions where appropriate.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim that the perturbation-based method maintains the target simultaneous coverage while producing narrower bands lacks any derivation, proof sketch, or experimental controls showing that the perturbation step preserves the conformal guarantee. The nonconformity scores (absolute differences between the two FNO predictions) are constructed from operators trained on nearly identical datasets, which risks violating the exchangeability assumption required for split conformal prediction marginal coverage.

    Authors: We agree that the manuscript provides no formal derivation or proof sketch establishing that the perturbation step preserves the conformal guarantee. The approach applies the standard split conformal prediction procedure with the nonconformity score defined as the absolute difference between the two FNO predictions. The small Gaussian perturbation is intended to yield a sensitivity-based scale while keeping the two training distributions close. We acknowledge the risk of a mild exchangeability violation due to correlated training sets. In revision we will add a paragraph to the abstract and a short discussion subsection in Methods that supplies a heuristic argument for approximate exchangeability when perturbation variance is small, together with new experiments that vary the perturbation magnitude and report empirical coverage. revision: partial

  2. Referee: [Method] Method section (description of local scale construction): the local uncertainty scale is defined as the absolute difference between predictions of two FNOs trained on original and perturbed labels; no analysis is provided demonstrating that this scale is independent of the perturbation magnitude or that the resulting scores on calibration points remain exchangeable with test points, which is load-bearing for the coverage claim.

    Authors: The manuscript indeed contains no explicit analysis of dependence on perturbation magnitude or of exchangeability between calibration and test scores. We will revise the Method section to add a sensitivity study that varies the Gaussian perturbation variance, reports resulting coverage rates and band widths, and includes a brief theoretical remark on the conditions under which approximate exchangeability holds for small perturbations. These additions will be supported by additional numerical results on the 2D Navier-Stokes benchmark. revision: yes

standing simulated objections not resolved
  • A complete, non-approximate theoretical proof that the perturbation exactly maintains the finite-sample coverage guarantee of split conformal prediction without relaxing the exchangeability assumption.

Circularity Check

0 steps flagged

No significant circularity; derivation relies on standard conformal prediction applied to an independent perturbation scale.

full rationale

The paper constructs the local uncertainty scale via an external perturbation operation (training a second FNO on Gaussian-perturbed labels and taking the absolute difference) and then applies split conformal prediction in the usual way. This scale is not defined in terms of the calibration residuals or the target coverage quantile, nor is any prediction or coverage claim reduced by construction to a fitted parameter from the same data. No self-citation chains, uniqueness theorems, or ansatzes are invoked to justify the core procedure. The method is therefore self-contained against the external benchmark of exchangeable nonconformity scores under split conformal prediction; any questions about whether the perturbation preserves exchangeability are matters of correctness, not circularity.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

Based solely on the abstract, the central claim rests on the domain assumption that small Gaussian label perturbation produces a useful uncertainty proxy; no free parameters or invented entities are explicitly named, though the perturbation variance is implicitly required and unstated.

free parameters (1)
  • perturbation noise variance
    The magnitude of the Gaussian noise added to labels is a tunable parameter whose value must be chosen to balance sensitivity and coverage preservation, though its selection procedure is not described.
axioms (1)
  • domain assumption Perturbation sensitivity between two nearly identical operators supplies a valid local scale for conformal prediction bands
    This premise is invoked to construct the uncertainty scale without a separate uncertainty network or data split.

pith-pipeline@v0.9.1-grok · 5719 in / 1388 out tokens · 21312 ms · 2026-06-27T18:38:24.798235+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

45 extracted references · 4 linked inside Pith

  1. [1]

    A. N. Angelopoulos and S. Bates. Conformal prediction: A gentle introduction.Founda- tions and Trends in Machine Learning, 16(4):494–591, 2023. arXiv:2107.07511

  2. [2]

    R. F. Barber, E. J. Candès, A. Ramdas, and R. J. Tibshirani. Conformal prediction beyond exchangeability.The Annals of Statistics, 51(2):816–845, 2023. arXiv:2202.13415

  3. [3]

    C. M. Bishop. Training with noise is equivalent to Tikhonov regularization.Neural Com- putation, 7(1):108–116, 1995

  4. [4]

    Boullé and A

    N. Boullé and A. Townsend. A mathematical guide to operator learning. InHandbook of Numerical Analysis, volume 25, pages 83–125. Elsevier, 2024. arXiv:2312.14688

  5. [5]

    Bülte, P

    C. Bülte, P. Scholl, and G. Kutyniok. Probabilistic neural operators for functional uncer- taintyquantification.Transactions on Machine Learning Research, 2025. arXiv:2502.12902

  6. [6]

    Daxberger, A

    E. Daxberger, A. Kristiadi, A. Immer, R. Eschenhagen, M. Bauer, and P. Hennig. Laplace Redux—effortless Bayesian deep learning. InAdvances in Neural Information Processing Systems, volume 34, pages 20089–20103, 2021. arXiv:2106.14806

  7. [7]

    Thelimitsofdistribution- free conditional predictive inference.Information and Inference: A Journal of the IMA, 10(2):455–482, 2021

    R.FoygelBarber, E.J.Candès, A.Ramdas, andR.J.Tibshirani. Thelimitsofdistribution- free conditional predictive inference.Information and Inference: A Journal of the IMA, 10(2):455–482, 2021. 12

  8. [8]

    Gal and Z

    Y. Gal and Z. Ghahramani. Dropout as a Bayesian approximation: Representing model uncertainty in deep learning. InProceedings of the 33rd International Conference on Ma- chine Learning, volume 48 ofProceedings of Machine Learning Research, pages 1050–1059. PMLR, 2016

  9. [9]

    Gopakumar, A

    V. Gopakumar, A. Gray, L. Zanisi, T. Nunn, D. Giles, M. Kusner, S. Pamela, and M. P. Deisenroth. Calibrated physics-informed uncertainty quantification. InProceedings of the 42nd International Conference on Machine Learning, volume 267 ofProceedings of Machine Learning Research, pages 20103–20141. PMLR, 2025. arXiv:2502.04406

  10. [10]

    Guerra, N

    N. Guerra, N. H. Nelsen, and Y. Yang. Learning where to learn: Training data distribution optimization for scientific machine learning.arXiv preprint arXiv:2505.21626, 2025

  11. [11]

    Oncalibrationofmodernneuralnetworks

    C.Guo, G.Pleiss, Y.Sun, andK.Q.Weinberger. Oncalibrationofmodernneuralnetworks. InProceedings of the 34th International Conference on Machine Learning, volume 70 of Proceedings of Machine Learning Research, pages 1321–1330. PMLR, 2017

  12. [12]

    Hendrycks and K

    D. Hendrycks and K. Gimpel. Gaussian error linear units (GELUs).arXiv preprint arXiv:1606.08415, 2016

  13. [13]

    D. P. Kingma and J. Ba. Adam: A method for stochastic optimization. InInternational Conference on Learning Representations, 2015. arXiv:1412.6980

  14. [14]

    N. B. Kovachki, Z. Li, B. Liu, K. Azizzadenesheli, K. Bhattacharya, A. M. Stuart, and A. Anandkumar. Neural operator: Learning maps between function spaces.Journal of Machine Learning Research, 24(89):1–97, 2023. arXiv:2108.08481

  15. [15]

    N. B. Kovachki, S. Lanthaler, and A. M. Stuart. Operator learning: Algorithms and analysis. InHandbook of Numerical Analysis, volume 25, pages 419–467. Elsevier, 2024

  16. [16]

    Lakshminarayanan, A

    B. Lakshminarayanan, A. Pritzel, and C. Blundell. Simple and scalable predictive uncer- tainty estimation using deep ensembles. InAdvances in Neural Information Processing Systems, volume 30, pages 6402–6413, 2017

  17. [17]

    Lei and L

    J. Lei and L. Wasserman. Distribution-free prediction bands for non-parametric regression. Journal of the Royal Statistical Society: Series B, 76(1):71–96, 2014

  18. [18]

    J. Lei, M. G’Sell, A. Rinaldo, R. J. Tibshirani, and L. Wasserman. Distribution- free predictive inference for regression.Journal of the American Statistical Association, 113(523):1094–1111, 2018

  19. [19]

    R. J. LeVeque.Finite Volume Methods for Hyperbolic Problems. Cambridge Texts in Applied Mathematics. Cambridge University Press, Cambridge, 2002

  20. [20]

    Z. Li, N. B. Kovachki, K. Azizzadenesheli, B. Liu, K. Bhattacharya, A. M. Stuart, and A. Anandkumar. Fourier neural operator for parametric partial differential equations. In International Conference on Learning Representations, 2021

  21. [21]

    Q. Li, M. Oprea, L. Wang, and Y. Yang. Stochastic inverse problem: Stability, regulariza- tion and Wasserstein gradient flow.arXiv preprint arXiv:2410.00229, 2024

  22. [22]

    L. Lu, P. Jin, G. Pang, Z. Zhang, and G. E. Karniadakis. Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators.Nature Machine Intelligence, 3(3):218–229, 2021

  23. [23]

    Z. Ma, D. Pitt, K. Azizzadenesheli, and A. Anandkumar. Calibrated uncertainty quantifi- cation for operator learning via conformal prediction.Transactions on Machine Learning Research, 2024. ISSN 2835-8856. URLhttps://openreview.net/forum?id=cGpegxy12T. 13

  24. [24]

    D. J. C. MacKay. A practical Bayesian framework for backpropagation networks.Neural Computation, 4(3):448–472, 1992

  25. [25]

    Magnani, N

    E. Magnani, N. Krämer, R. Eschenhagen, L. Rosasco, and P. Hennig. Approximate Bayesian neural operators: Uncertainty quantification for parametric PDEs.Transactions on Machine Learning Research, 2025. arXiv:2208.01565

  26. [26]

    A. J. Majda and A. L. Bertozzi.Vorticity and Incompressible Flow. Cambridge Texts in Applied Mathematics. Cambridge University Press, Cambridge, 2002

  27. [27]

    Millard, L

    D. Millard, L. Lindemann, and A. Baheri. Split conformal prediction in the function space with neural operators.arXiv preprint arXiv:2509.04623, 2025

  28. [28]

    Molinaro, Y

    R. Molinaro, Y. Yang, B. Engquist, and S. Mishra. Neural inverse operators for solving PDE inverse problems. InProceedings of the 40th International Conference on Machine Learning, volume 202 ofProceedings of Machine Learning Research, pages 25105–25139. PMLR, 2023. arXiv:2301.11167

  29. [29]

    C. Moya, A. Mollaali, Z. Zhang, L. Lu, and G. Lin. Conformalized-DeepONet: A distribution-free framework for uncertainty quantification in deep operator networks.Phys- ica D: Nonlinear Phenomena, 471:134418, 2025. arXiv:2402.15406

  30. [30]

    N. H. Nelsen and A. M. Stuart. Operator learning using random features: A tool for scientific computing.SIAM Review, 66(3):535–571, 2024

  31. [31]

    N. H. Nelsen and Y. Yang. Operator learning meets inverse problems: A probabilistic perspective. To appear inHandbook of Numerical Analysis, volume 27:Machine Learning Solutions for Inverse Problems (Part B), Elsevier, 2026. arXiv:2508.20207

  32. [32]

    Ovadia, E

    Y. Ovadia, E. Fertig, J. Ren, Z. Nado, D. Sculley, S. Nowozin, J. V. Dillon, B. Lakshmi- narayanan, and J. Snoek. Can you trust your model’s uncertainty? Evaluating predictive uncertainty under dataset shift. InAdvances in Neural Information Processing Systems, volume 32, 2019. arXiv:1906.02530

  33. [33]

    Papadopoulos, K

    H. Papadopoulos, K. Proedrou, V. Vovk, and A. Gammerman. Inductive confidence ma- chines for regression. InMachine Learning: ECML 2002, volume 2430 ofLecture Notes in Computer Science, pages 345–356. Springer, 2002

  34. [34]

    Quarteroni, R

    A. Quarteroni, R. Sacco, and F. Saleri.Numerical Mathematics. Texts in Applied Mathe- matics, volume 37. Springer, Berlin, 2000

  35. [35]

    Ritter, A

    H. Ritter, A. Botev, and D. Barber. A scalable Laplace approximation for neural networks. InInternational Conference on Learning Representations, 2018

  36. [36]

    Conformalizedquantileregression

    Y.Romano, E.Patterson, andE.J.Candès. Conformalizedquantileregression. InAdvances in Neural Information Processing Systems, volume 32, 2019. arXiv:1905.03222

  37. [37]

    Subedi and A

    U. Subedi and A. Tewari. Controlling statistical, discretization, and truncation errors in learning Fourier linear operators.Transactions on Machine Learning Research, 2025. arXiv:2408.09004

  38. [38]

    Subedi and A

    U. Subedi and A. Tewari. Operator learning: A statistical perspective.Annual Review of Statistics and Its Application, 13:123–148, 2026. arXiv:2504.03503

  39. [39]

    Temam.Navier–Stokes Equations: Theory and Numerical Analysis

    R. Temam.Navier–Stokes Equations: Theory and Numerical Analysis. AMS Chelsea Publishing, Providence, RI, 2001. Reprint of the 1984 edition. 14

  40. [40]

    R. J. Tibshirani, R. Foygel Barber, E. J. Candès, and A. Ramdas. Conformal prediction under covariate shift. InAdvances in Neural Information Processing Systems, volume 32,

  41. [41]

    L. N. Trefethen.Spectral Methods in MATLAB. SIAM, Philadelphia, 2000

  42. [42]

    van Leeuwen and Y

    T. van Leeuwen and Y. Yang. An analysis of constraint-relaxation in PDE-based inverse problems.Inverse Problems, 41(2):025009, 2025. arXiv:2403.15292

  43. [43]

    V. Vovk, A. Gammerman, and G. Shafer.Algorithmic Learning in a Random World. Springer, New York, 2005

  44. [44]

    Winovich, M

    N. Winovich, M. Daneker, L. Lu, and G. Lin. Active operator learning with predictive un- certaintyquantificationforpartialdifferentialequations.Journal of Computational Physics, 555:114791, 2026. arXiv:2503.03178

  45. [45]

    Y. Yu, C. H. Ho, and Y. Wang. A conformal prediction framework for uncertainty quan- tification in physics-informed neural networks.arXiv preprint arXiv:2509.13717, 2025. 15