arXiv preprint arXiv:2505.08783 , year=

Shanda Li, Tanya Marwah, Junhong Shen, Weiwei Sun, Andrej Risteski, Yiming Yang, Ameet Talwalkar · 2025 · arXiv 2505.08783

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 1 baseline 1

citation-polarity summary

background 1 baseline 1

representative citing papers

PDEAgent-Bench: A Multi-Metric, Multi-Library Benchmark for PDE Solver Generation

cs.AI · 2026-05-10 · unverdicted · novelty 8.0

PDEAgent-Bench is the first multi-metric, multi-library benchmark for AI-generated PDE solvers, evaluating executability, numerical accuracy, and efficiency across DOLFINx, Firedrake, and deal.II.

LLM-driven design of physics-constrained constitutive models: two agents are better than one

cs.LG · 2026-05-22 · unverdicted · novelty 7.0

A Creator-Inspector multi-agent LLM pipeline for constitutive artificial neural networks increases the rate of models satisfying all nine physical constraints to 100% or 56% depending on the LLM backbone.

SciML Agents: Write the Solver, Not the Solution

cs.LG · 2025-09-12 · unverdicted · novelty 7.0

LLMs prompted with domain knowledge can generate runnable, numerically valid code for stiff and non-stiff ODEs on new diagnostic and 1000-task benchmarks.

AutoPDE: Reliable Agentic PDE Solving via Explicitly Represented Solver Strategies

cs.AI · 2026-06-09 · unverdicted · novelty 6.0

AutoPDE maintains an explicit solver strategy through PDE analysis, numerical method selection, and adaptive tuning, achieving 54.5% pass rate on PDE Agent Bench, 14.2 points above the strongest baseline.

Reasoning4Sciences: Bridging Reasoning Language Models to All Scientific Branches

cs.AI · 2026-05-31 · unverdicted · novelty 6.0 · 2 refs

A survey of RLM use in 28 disciplines reveals uneven adoption and introduces a maturity assessment framework showing larger gaps when limited to public resources.

citing papers explorer

Showing 5 of 5 citing papers.

PDEAgent-Bench: A Multi-Metric, Multi-Library Benchmark for PDE Solver Generation cs.AI · 2026-05-10 · unverdicted · none · ref 11
PDEAgent-Bench is the first multi-metric, multi-library benchmark for AI-generated PDE solvers, evaluating executability, numerical accuracy, and efficiency across DOLFINx, Firedrake, and deal.II.
LLM-driven design of physics-constrained constitutive models: two agents are better than one cs.LG · 2026-05-22 · unverdicted · none · ref 43
A Creator-Inspector multi-agent LLM pipeline for constitutive artificial neural networks increases the rate of models satisfying all nine physical constraints to 100% or 56% depending on the LLM backbone.
SciML Agents: Write the Solver, Not the Solution cs.LG · 2025-09-12 · unverdicted · none · ref 33
LLMs prompted with domain knowledge can generate runnable, numerically valid code for stiff and non-stiff ODEs on new diagnostic and 1000-task benchmarks.
AutoPDE: Reliable Agentic PDE Solving via Explicitly Represented Solver Strategies cs.AI · 2026-06-09 · unverdicted · none · ref 53
AutoPDE maintains an explicit solver strategy through PDE analysis, numerical method selection, and adaptive tuning, achieving 54.5% pass rate on PDE Agent Bench, 14.2 points above the strongest baseline.
Reasoning4Sciences: Bridging Reasoning Language Models to All Scientific Branches cs.AI · 2026-05-31 · unverdicted · none · ref 159 · 2 links
A survey of RLM use in 28 disciplines reveals uneven adoption and introduces a maturity assessment framework showing larger gaps when limited to public resources.

arXiv preprint arXiv:2505.08783 , year=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer