Lectures on AI for Mathematics

Xiaoyang Chen

arxiv: 2604.11504 · v1 · submitted 2026-04-13 · 💻 cs.AI · math.AP· math.AT· math.DG

Lectures on AI for Mathematics

Xiaoyang Chen This is my paper

Pith reviewed 2026-05-10 16:19 UTC · model grok-4.3

classification 💻 cs.AI math.APmath.ATmath.DG

keywords AI for mathematicsmathematical pattern discoveryautomated theorem provingcounterexample generationAI in researchmathematical conjectures

0 comments

The pith

AI can discover hidden mathematical patterns, assist in proving theorems, and construct counterexamples to challenge conjectures.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This book introduces the emerging field of applying artificial intelligence to mathematical research. It explains core principles for using AI to find patterns that may not be obvious to human researchers. The text describes applications where AI helps prove difficult theorems through automated methods. It also covers how AI can generate counterexamples to test or refute mathematical conjectures. A reader would care because this positions AI as a potential tool to tackle problems that have remained unsolved for a long time.

Core claim

The book establishes that artificial intelligence offers practical methods to advance mathematics by discovering hidden patterns in structures, supporting the proof of complicated theorems, and creating counterexamples that challenge existing conjectures.

What carries the argument

AI techniques for pattern recognition, automated reasoning, and counterexample generation applied directly to mathematical objects and problems.

If this is right

Mathematicians could use AI to analyze large sets of data for new structures and relations.
Conjectures that resist traditional proof methods might be settled by AI-generated counterexamples.
The process of theorem proving could become faster through AI assistance in step verification and suggestion.
Research in fields like number theory or algebra might accelerate by combining human insight with AI pattern detection.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This framework might eventually allow AI to propose entirely new conjectures based on observed patterns across different areas of math.
Integration of such AI tools could change mathematics education by letting students interact with automated discovery systems.
Similar methods could extend to related domains like theoretical physics where mathematical patterns underpin physical laws.

Load-bearing premise

That current or near-future AI systems can actually perform these mathematical tasks in ways that meaningfully advance research rather than only describing limited or speculative uses.

What would settle it

An experiment that trains an AI on large collections of mathematical data and then checks whether it can independently prove a known theorem or produce a valid counterexample to an open conjecture would test the central claim.

Figures

Figures reproduced from arXiv: 2604.11504 by Xiaoyang Chen.

**Figure 2.** Figure 2 [PITH_FULL_IMAGE:figures/full_fig_p018_2.png] view at source ↗

**Figure 2.** Figure 2 [PITH_FULL_IMAGE:figures/full_fig_p021_2.png] view at source ↗

**Figure 2.** Figure 2 [PITH_FULL_IMAGE:figures/full_fig_p027_2.png] view at source ↗

**Figure 2.** Figure 2 [PITH_FULL_IMAGE:figures/full_fig_p029_2.png] view at source ↗

**Figure 5.** Figure 5: in Qiu Xipeng’s “Neural Networks and Deep Learning”). [PITH_FULL_IMAGE:figures/full_fig_p030_5.png] view at source ↗

**Figure 2.** Figure 2 [PITH_FULL_IMAGE:figures/full_fig_p031_2.png] view at source ↗

**Figure 2.** Figure 2 [PITH_FULL_IMAGE:figures/full_fig_p032_2.png] view at source ↗

**Figure 2.** Figure 2 [PITH_FULL_IMAGE:figures/full_fig_p037_2.png] view at source ↗

**Figure 2.** Figure 2 [PITH_FULL_IMAGE:figures/full_fig_p041_2.png] view at source ↗

**Figure 2.** Figure 2 [PITH_FULL_IMAGE:figures/full_fig_p046_2.png] view at source ↗

**Figure 2.** Figure 2 [PITH_FULL_IMAGE:figures/full_fig_p047_2.png] view at source ↗

**Figure 3.** Figure 3 [PITH_FULL_IMAGE:figures/full_fig_p082_3.png] view at source ↗

**Figure 5.** Figure 5 [PITH_FULL_IMAGE:figures/full_fig_p110_5.png] view at source ↗

**Figure 6.** Figure 6 [PITH_FULL_IMAGE:figures/full_fig_p128_6.png] view at source ↗

**Figure 6.** Figure 6 [PITH_FULL_IMAGE:figures/full_fig_p144_6.png] view at source ↗

read the original abstract

This book provides a comprehensive and accessible introduction to the emerging field of AI for mathematics. It covers the core principles and diverse applications of using artificial intelligence to advance mathematical research. Through clear explanations, the text explores how AI can discover hidden mathematical patterns, assist in proving complicated theorems, and even construct counterexamples to challenge conjectures.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 0 minor

Summary. This manuscript is a book-length set of lectures providing a comprehensive and accessible introduction to the emerging field of AI for mathematics. It covers core principles and diverse applications, including how AI can discover hidden mathematical patterns, assist in proving complicated theorems, and construct counterexamples to challenge conjectures, supported by references to prior work rather than new demonstrations.

Significance. As a survey-style overview, the book could serve as a useful educational resource for students and researchers entering the intersection of AI and mathematics if the explanations are clear and the cited applications are current. However, it advances no original theorems, empirical results, or proofs, so its significance is limited to synthesis and accessibility rather than advancing the state of the art. No machine-checked proofs, reproducible code, or falsifiable predictions are present, consistent with its introductory scope.

major comments (1)

Abstract and overall framing: the central statements describe AI capabilities for pattern discovery, theorem assistance, and counterexample construction as established or near-term outcomes, yet the text is explicitly an overview without new quantitative claims, derivations, or demonstrations; this makes the weakest assumption (that current AI systems can meaningfully advance research in these ways) load-bearing but untested within the manuscript itself.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their review and recommendation of minor revision. We agree that the manuscript is an introductory survey and have revised the abstract and framing to more precisely attribute the discussed AI capabilities to existing literature.

read point-by-point responses

Referee: Abstract and overall framing: the central statements describe AI capabilities for pattern discovery, theorem assistance, and counterexample construction as established or near-term outcomes, yet the text is explicitly an overview without new quantitative claims, derivations, or demonstrations; this makes the weakest assumption (that current AI systems can meaningfully advance research in these ways) load-bearing but untested within the manuscript itself.

Authors: We appreciate the referee highlighting this potential source of ambiguity. The manuscript is explicitly positioned as a survey of the field and contains no new empirical results or proofs, as correctly noted in the report. The abstract summarizes the topics addressed in the lectures, which are drawn from documented applications in the cited prior work. To address the concern, we have revised the abstract to explicitly frame these as capabilities explored through existing research: 'This book provides a comprehensive and accessible introduction to the emerging field of AI for mathematics. It covers the core principles and diverse applications of using artificial intelligence to advance mathematical research, as demonstrated in the literature, including how AI can discover hidden mathematical patterns, assist in proving complicated theorems, and construct counterexamples to challenge conjectures.' This change ensures the claims are presented as a synthesis of established results rather than untested assumptions internal to the manuscript. We believe the revision resolves the framing issue without altering the introductory scope. revision: yes

Circularity Check

0 steps flagged

No significant circularity; survey text with no derivations or predictions

full rationale

The document is a survey-style lecture book introducing existing and emerging applications of AI to mathematics. It does not advance an original central claim, theorem, or empirical result that requires internal verification. Descriptions of pattern discovery, theorem assistance, and counterexample construction are presented as overviews of the field, supported by references to prior work rather than new demonstrations or proofs within the text. No equations, fitted parameters, predictions, self-definitions, or self-citation chains appear that could reduce to inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No free parameters, axioms, or invented entities are introduced because the work is an overview text rather than a technical derivation.

pith-pipeline@v0.9.0 · 5328 in / 926 out tokens · 34351 ms · 2026-05-10T16:19:35.440535+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

32 extracted references · 32 canonical work pages

[1]

Solving inverse problems using data-driven models

Simon Arridge et al. “Solving inverse problems using data-driven models”. In: Acta Numerica 28 (2019), pp. 1–174

work page 2019
[2]

Model reduction and approximation

Peter Benner et al. Model reduction and approximation. SIAM, 2017

work page 2017
[3]

Promising directions of machine learning for partial differential equations

Steven L Brunton and J Nathan Kutz. “Promising directions of machine learning for partial differential equations”. In: Nature Computational Science 4.7 (2024), pp. 483–494

work page 2024
[4]

Spectral methods: Fundamentals in single domains

Claudio Canuto et al. Spectral methods: Fundamentals in single domains . Springer, 2006

work page 2006
[5]

Separable physics-informed neural networks

Junwoo Cho et al. “Separable physics-informed neural networks”. In: Advances in Neural Information Processing Systems. Vol. 36. 2023

work page 2023
[6]

Finite element methods for flow problems

Jean Donea and Antonio Huerta. Finite element methods for flow problems. John Wiley & Sons, 2003

work page 2003
[7]

Partial differential equations

Lawrence C Evans. Partial differential equations. American Mathematical Society, 2010

work page 2010
[8]

Computational methods for fluid dynamics

Joel H Ferziger and Milovan Perić. Computational methods for fluid dynamics. Springer, 2012

work page 2012
[9]

A first course in finite elements

Jacob Fish and Ted Belytschko. A first course in finite elements. John Wiley & Sons, 2007

work page 2007
[10]

Sparse grids and applications

Jochen Garcke and Michael Griebel. Sparse grids and applications . Springer, 2012

work page 2012
[11]

DaPINN: Dimension-augmented physics-informed neural networks

Jian Guan et al. “DaPINN: Dimension-augmented physics-informed neural networks”. In: arXiv preprint (2023). 155 6.3 Applications

work page 2023
[12]

A physics-informed deep learning framework for inversion and surrogate mod- eling in solid mechanics

Ehsan Haghighat et al. “A physics-informed deep learning framework for inversion and surrogate mod- eling in solid mechanics”. In: Computer Methods in Applied Mechanics and Engineering 379 (2021), p. 113741

work page 2021
[13]

Extended physics-informed neural networks (XPINNs): A generalized space-time domain decomposition based deep learning framework for nonlinear partial differential equations

Ameya D Jagtap and George Em Karniadakis. “Extended physics-informed neural networks (XPINNs): A generalized space-time domain decomposition based deep learning framework for nonlinear partial differential equations”. In: Communications in Computational Physics 28.5 (2020), pp. 2002–2041

work page 2020
[14]

Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems

Ameya D Jagtap, Ehsan Kharazmi, and George Em Karniadakis. “Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems”. In: Computer Methods in Applied Mechanics and Engineering 365 (2020), p. 113028

work page 2020
[15]

Physics-informed machine learning

George Em Karniadakis et al. “Physics-informed machine learning”. In: Nature Reviews Physics 3.6 (2021), pp. 422–440

work page 2021
[16]

Characterizing possible failure modes in physics-informed neural networks

Aditi Krishnapriyan et al. “Characterizing possible failure modes in physics-informed neural networks”. In: Advances in Neural Information Processing Systems. Vol. 34. 2021, pp. 26548–26560

work page 2021
[17]

SIAM, 2007

Randall J LeVeque.Finite difference methods for ordinary and partial differential equations. SIAM, 2007

work page 2007
[18]

Finite volume methods for hyperbolic problems

Randall J LeVeque. Finite volume methods for hyperbolic problems. Cambridge University Press, 2002

work page 2002
[19]

Physics-informed neural networks for PDE problems: A comprehensive review

S Liao et al. “Physics-informed neural networks for PDE problems: A comprehensive review”. In: arXiv preprint (2025)

work page 2025
[20]

Applied partial differential equations

J David Logan. Applied partial differential equations. Springer, 2015

work page 2015
[21]

DeepXDE: A deep learning library for solving differential equations

Lu Lu et al. “DeepXDE: A deep learning library for solving differential equations”. In:SIAM Review 63.1 (2021), pp. 208–228

work page 2021
[22]

Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators

Lu Lu et al. “Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators”. In: Nature Machine Intelligence 3.3 (2021), pp. 218–229

work page 2021
[23]

Numerical solution of partial differential equations

K W Morton and D F Mayers. Numerical solution of partial differential equations. Cambridge University Press, 2005

work page 2005
[24]

On the spectral bias of neural networks

Nasim Rahaman et al. “On the spectral bias of neural networks”. In:International Conference on Machine Learning. PMLR. 2019, pp. 5301–5310

work page 2019
[25]

Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equa- tions

Maziar Raissi, Paris Perdikaris, and George Em Karniadakis. “Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equa- tions”. In: Journal of Computational Physics 378 (2019), pp. 686–707

work page 2019
[26]

DGM: A deep learning algorithm for solving partial differential equations

Justin Sirignano and Konstantinos Spiliopoulos. “DGM: A deep learning algorithm for solving partial differential equations”. In: Journal of Computational Physics 375 (2018), pp. 1339–1364

work page 2018
[27]

Fourier features let networks learn high frequency functions in low dimensional domains

Matthew Tancik et al. “Fourier features let networks learn high frequency functions in low dimensional domains”. In: Advances in Neural Information Processing Systems. Vol. 33. 2020, pp. 7537–7547

work page 2020
[28]

Spectral methods in MATLAB

Lloyd N Trefethen. Spectral methods in MATLAB . SIAM, 2000

work page 2000
[29]

Physics-informed neural networks for fluid-structure interaction

Rui Wang et al. “Physics-informed neural networks for fluid-structure interaction”. In: arXiv preprint (2023)

work page 2023
[30]

On the eigenvector bias of Fourier feature networks: From regression to solving multi-scale PDEs with physics-informed neural networks

Sifan Wang, Yujun Teng, and Paris Perdikaris. “On the eigenvector bias of Fourier feature networks: From regression to solving multi-scale PDEs with physics-informed neural networks”. In: Computer Methods in Applied Mechanics and Engineering 393 (2022), p. 114823. 156 6.3 Applications

work page 2022
[31]

Understanding and mitigating gradient flow pathologies in physics-informed neural networks

Sifan Wang, Yujun Teng, and Paris Perdikaris. “Understanding and mitigating gradient flow pathologies in physics-informed neural networks”. In: SIAM Journal on Scientific Computing 43.5 (2021), A3055– A3081

work page 2021
[32]

The finite element method: Its basis and funda- mentals

Olek C Zienkiewicz, Robert L Taylor, and Jian Z Zhu. The finite element method: Its basis and funda- mentals. Elsevier, 2013. 157

work page 2013

[1] [1]

Solving inverse problems using data-driven models

Simon Arridge et al. “Solving inverse problems using data-driven models”. In: Acta Numerica 28 (2019), pp. 1–174

work page 2019

[2] [2]

Model reduction and approximation

Peter Benner et al. Model reduction and approximation. SIAM, 2017

work page 2017

[3] [3]

Promising directions of machine learning for partial differential equations

Steven L Brunton and J Nathan Kutz. “Promising directions of machine learning for partial differential equations”. In: Nature Computational Science 4.7 (2024), pp. 483–494

work page 2024

[4] [4]

Spectral methods: Fundamentals in single domains

Claudio Canuto et al. Spectral methods: Fundamentals in single domains . Springer, 2006

work page 2006

[5] [5]

Separable physics-informed neural networks

Junwoo Cho et al. “Separable physics-informed neural networks”. In: Advances in Neural Information Processing Systems. Vol. 36. 2023

work page 2023

[6] [6]

Finite element methods for flow problems

Jean Donea and Antonio Huerta. Finite element methods for flow problems. John Wiley & Sons, 2003

work page 2003

[7] [7]

Partial differential equations

Lawrence C Evans. Partial differential equations. American Mathematical Society, 2010

work page 2010

[8] [8]

Computational methods for fluid dynamics

Joel H Ferziger and Milovan Perić. Computational methods for fluid dynamics. Springer, 2012

work page 2012

[9] [9]

A first course in finite elements

Jacob Fish and Ted Belytschko. A first course in finite elements. John Wiley & Sons, 2007

work page 2007

[10] [10]

Sparse grids and applications

Jochen Garcke and Michael Griebel. Sparse grids and applications . Springer, 2012

work page 2012

[11] [11]

DaPINN: Dimension-augmented physics-informed neural networks

Jian Guan et al. “DaPINN: Dimension-augmented physics-informed neural networks”. In: arXiv preprint (2023). 155 6.3 Applications

work page 2023

[12] [12]

A physics-informed deep learning framework for inversion and surrogate mod- eling in solid mechanics

Ehsan Haghighat et al. “A physics-informed deep learning framework for inversion and surrogate mod- eling in solid mechanics”. In: Computer Methods in Applied Mechanics and Engineering 379 (2021), p. 113741

work page 2021

[13] [13]

Extended physics-informed neural networks (XPINNs): A generalized space-time domain decomposition based deep learning framework for nonlinear partial differential equations

Ameya D Jagtap and George Em Karniadakis. “Extended physics-informed neural networks (XPINNs): A generalized space-time domain decomposition based deep learning framework for nonlinear partial differential equations”. In: Communications in Computational Physics 28.5 (2020), pp. 2002–2041

work page 2020

[14] [14]

Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems

Ameya D Jagtap, Ehsan Kharazmi, and George Em Karniadakis. “Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems”. In: Computer Methods in Applied Mechanics and Engineering 365 (2020), p. 113028

work page 2020

[15] [15]

Physics-informed machine learning

George Em Karniadakis et al. “Physics-informed machine learning”. In: Nature Reviews Physics 3.6 (2021), pp. 422–440

work page 2021

[16] [16]

Characterizing possible failure modes in physics-informed neural networks

Aditi Krishnapriyan et al. “Characterizing possible failure modes in physics-informed neural networks”. In: Advances in Neural Information Processing Systems. Vol. 34. 2021, pp. 26548–26560

work page 2021

[17] [17]

SIAM, 2007

Randall J LeVeque.Finite difference methods for ordinary and partial differential equations. SIAM, 2007

work page 2007

[18] [18]

Finite volume methods for hyperbolic problems

Randall J LeVeque. Finite volume methods for hyperbolic problems. Cambridge University Press, 2002

work page 2002

[19] [19]

Physics-informed neural networks for PDE problems: A comprehensive review

S Liao et al. “Physics-informed neural networks for PDE problems: A comprehensive review”. In: arXiv preprint (2025)

work page 2025

[20] [20]

Applied partial differential equations

J David Logan. Applied partial differential equations. Springer, 2015

work page 2015

[21] [21]

DeepXDE: A deep learning library for solving differential equations

Lu Lu et al. “DeepXDE: A deep learning library for solving differential equations”. In:SIAM Review 63.1 (2021), pp. 208–228

work page 2021

[22] [22]

Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators

Lu Lu et al. “Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators”. In: Nature Machine Intelligence 3.3 (2021), pp. 218–229

work page 2021

[23] [23]

Numerical solution of partial differential equations

K W Morton and D F Mayers. Numerical solution of partial differential equations. Cambridge University Press, 2005

work page 2005

[24] [24]

On the spectral bias of neural networks

Nasim Rahaman et al. “On the spectral bias of neural networks”. In:International Conference on Machine Learning. PMLR. 2019, pp. 5301–5310

work page 2019

[25] [25]

Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equa- tions

Maziar Raissi, Paris Perdikaris, and George Em Karniadakis. “Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equa- tions”. In: Journal of Computational Physics 378 (2019), pp. 686–707

work page 2019

[26] [26]

DGM: A deep learning algorithm for solving partial differential equations

Justin Sirignano and Konstantinos Spiliopoulos. “DGM: A deep learning algorithm for solving partial differential equations”. In: Journal of Computational Physics 375 (2018), pp. 1339–1364

work page 2018

[27] [27]

Fourier features let networks learn high frequency functions in low dimensional domains

Matthew Tancik et al. “Fourier features let networks learn high frequency functions in low dimensional domains”. In: Advances in Neural Information Processing Systems. Vol. 33. 2020, pp. 7537–7547

work page 2020

[28] [28]

Spectral methods in MATLAB

Lloyd N Trefethen. Spectral methods in MATLAB . SIAM, 2000

work page 2000

[29] [29]

Physics-informed neural networks for fluid-structure interaction

Rui Wang et al. “Physics-informed neural networks for fluid-structure interaction”. In: arXiv preprint (2023)

work page 2023

[30] [30]

On the eigenvector bias of Fourier feature networks: From regression to solving multi-scale PDEs with physics-informed neural networks

Sifan Wang, Yujun Teng, and Paris Perdikaris. “On the eigenvector bias of Fourier feature networks: From regression to solving multi-scale PDEs with physics-informed neural networks”. In: Computer Methods in Applied Mechanics and Engineering 393 (2022), p. 114823. 156 6.3 Applications

work page 2022

[31] [31]

Understanding and mitigating gradient flow pathologies in physics-informed neural networks

Sifan Wang, Yujun Teng, and Paris Perdikaris. “Understanding and mitigating gradient flow pathologies in physics-informed neural networks”. In: SIAM Journal on Scientific Computing 43.5 (2021), A3055– A3081

work page 2021

[32] [32]

The finite element method: Its basis and funda- mentals

Olek C Zienkiewicz, Robert L Taylor, and Jian Z Zhu. The finite element method: Its basis and funda- mentals. Elsevier, 2013. 157

work page 2013