PDEAgent-Bench is the first multi-metric, multi-library benchmark for AI-generated PDE solvers, evaluating executability, numerical accuracy, and efficiency across DOLFINx, Firedrake, and deal.II.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3verdicts
UNVERDICTED 3roles
baseline 1polarities
baseline 1representative citing papers
The Paired Swap Permutation Test is an exact non-parametric procedure that compares explanatory power of two dependent predictors via symmetric within-subject swapping for categorical data and ECDF mapping for continuous data.
Mosaic is a benchmark suite evaluating 14 differentiable PDE solvers across fluids, structures, and heat transfer, showing large variations in cost and conditioning but similar convergence behavior.
citing papers explorer
-
PDEAgent-Bench: A Multi-Metric, Multi-Library Benchmark for PDE Solver Generation
PDEAgent-Bench is the first multi-metric, multi-library benchmark for AI-generated PDE solvers, evaluating executability, numerical accuracy, and efficiency across DOLFINx, Firedrake, and deal.II.
-
Exact Comparison of Explanatory Strength of Two Dependent Predictors
The Paired Swap Permutation Test is an exact non-parametric procedure that compares explanatory power of two dependent predictors via symmetric within-subject swapping for categorical data and ECDF mapping for continuous data.
-
Mosaic: A Benchmark Suite for Differentiable Physics Solvers
Mosaic is a benchmark suite evaluating 14 differentiable PDE solvers across fluids, structures, and heat transfer, showing large variations in cost and conditioning but similar convergence behavior.