AXLE: A Cloud Infrastructure for Lean 4 Theorem Proving Utilities
Pith reviewed 2026-06-26 00:24 UTC · model grok-4.3
The pith
AXLE delivers 14 Lean 4 metaprogramming tools through a multi-tenant cloud service with per-request isolation and multi-version support.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
AXLE is a cloud service that exposes 14 Lean 4 metaprogramming tools for proof verification, metadata extraction, source manipulation, deterministic repair and simplification, and lemma extraction; it operates as a multi-tenant deployment with per-request isolation, concurrent support for multiple Lean 4 and Mathlib versions, and access through Python SDK, CLI, web UI, MCP server, and HTTP API, having served over 500 million requests while powering Axiom Math's proving results.
What carries the argument
The multi-tenant cloud deployment with per-request isolation that runs the 14 metaprogramming tools while supporting multiple Lean 4 and Mathlib versions simultaneously.
If this is right
- AI reinforcement learning pipelines and agentic proving workflows can issue millions of Lean 4 operations without managing local installations or version conflicts.
- Dataset curation for mathematics can extract lemmas and manipulate source code at the throughput required by modern training runs.
- Proof repair and simplification become deterministic operations available on demand to any user or script.
- Multiple research groups can share the same infrastructure while each request remains isolated from the others.
- Public access removes the requirement for deep Lean 4 installation expertise before experimenting with formal proof.
Where Pith is reading between the lines
- Teams without dedicated DevOps resources for Lean 4 could still run large-scale verification experiments by calling the service.
- The same isolation model might later support other interactive theorem provers if similar metaprogramming layers are added.
- Widespread use could create public logs of common proof patterns that feed into future lemma suggestion systems.
- Integration with existing Python ML stacks becomes straightforward through the provided SDK.
Load-bearing premise
The multi-tenant cloud architecture maintains correctness, robustness, and version compatibility at the claimed scale without introducing interference or verification errors.
What would settle it
A case where two concurrent requests using different Lean 4 versions produce inconsistent verification outcomes on the same declaration, or where a request returns an incorrect proof status compared with an independent local Lean 4 execution.
read the original abstract
We present AXLE (Axiom Lean Engine), a cloud service for Lean 4 proof manipulation, extraction, and verification. Recent progress in AI for mathematics -- reinforcement learning pipelines, agentic proving workflows, dataset curation -- demands Lean 4 tooling that scales to millions of requests while remaining correct and robust; existing infrastructure offers parallel compilation but not scalable proof verification, higher-level proof manipulation, multi-version support, or per-request isolation at the throughput modern AI workflows require. AXLE provides 14 Lean 4 metaprogramming tools spanning strict proof verification, declaration metadata extraction, semantic source manipulation, deterministic proof repair and simplification, and lemma extraction. The service runs as a multi-tenant cloud deployment with per-request isolation and concurrent support for multiple Lean 4 and Mathlib versions, accessible via a Python SDK, command-line interface, web UI, MCP server, and raw HTTP API. AXLE is publicly available and free to use at https://axle.axiommath.ai and via the axiom-axle PyPI package, with no local Lean 4 installation required. It has served over 500 million requests to date and is the underlying infrastructure for Axiom Math's proving efforts, including its 12/12 score on the 2025 Putnam competition.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper presents AXLE (Axiom Lean Engine), a publicly available cloud service for Lean 4 proof manipulation, extraction, and verification. It implements 14 metaprogramming tools covering strict proof verification, declaration metadata extraction, semantic source manipulation, deterministic proof repair and simplification, and lemma extraction. The service is deployed as a multi-tenant cloud infrastructure with per-request isolation and concurrent support for multiple Lean 4 and Mathlib versions. Access is provided via Python SDK, CLI, web UI, MCP server, and raw HTTP API with no local Lean 4 installation required. The paper states that AXLE has served over 500 million requests and serves as the underlying infrastructure for Axiom Math's proving efforts, including a 12/12 score on the 2025 Putnam competition.
Significance. If the described capabilities and deployment hold, AXLE provides a practical, scalable solution to a recognized bottleneck in AI-for-mathematics research: the need for high-throughput, correct, and isolated Lean 4 interactions without local setup. The public endpoint, multi-version support, and per-request isolation directly address limitations of existing parallel-compilation tools. Credit is due for the explicit public release, the SDK, and the demonstrated downstream use in a high-visibility proving task. These elements position the work as a useful systems contribution that can be immediately adopted by reinforcement-learning pipelines, agentic provers, and dataset curators.
minor comments (2)
- [Abstract] Abstract: the assertion of 'feature completeness' and 'strict proof verification' is presented at a high level without enumerating the 14 tools or distinguishing the verification mode from standard Lean kernel checks; a concise table or enumerated list in the main text would improve clarity.
- [Abstract] The manuscript supplies no high-level performance indicators (e.g., average latency, error rate under concurrent load, or version-compatibility test coverage) even though the central narrative emphasizes scale and robustness; while external verification is possible via the public endpoint, such indicators would make the paper more self-contained.
Simulated Author's Rebuttal
We thank the referee for their thorough review, positive assessment of AXLE's contributions, and recommendation to accept the manuscript. We are pleased that the work is recognized as addressing a practical bottleneck in AI-for-mathematics tooling through its public release, multi-version support, per-request isolation, and demonstrated high-volume usage.
Circularity Check
No significant circularity
full rationale
The paper is a systems description of a deployed cloud service providing Lean 4 metaprogramming tools. It contains no derivations, equations, predictions, fitted parameters, or load-bearing self-citations. All central claims (existence of 14 tools, multi-tenant architecture, request volume, Putnam usage) are presented as directly observable facts about a public service rather than results computed from internal inputs that could reduce to those inputs by construction.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Aniva, L., Sun, C., Miranda, B., Barrett, C., and Koyejo, S. Pantograph : A machine-to-machine interaction interface for advanced theorem proving, high-level reasoning, and data extraction in Lean 4. In Gurfinkel, A. and Heule, M. (eds.), Tools and Algorithms for the Construction and Analysis of Systems (TACAS), pp.\ 104--123. Springer Nature Switzerland,...
-
[2]
Goedel-workbook-lean-4.27: Lean workbook competition-math proofs from DeepSeek-Prover-V1.5 , migrated to Lean 4.27.0 / Mathlib v4.27.0
banach1729 . Goedel-workbook-lean-4.27: Lean workbook competition-math proofs from DeepSeek-Prover-V1.5 , migrated to Lean 4.27.0 / Mathlib v4.27.0. https://huggingface.co/datasets/banach1729/goedel-workbook-lean427, 2026
2026
-
[3]
Almost all primes are partially regular
Chen, E., Cummins, C., Eltschig, B., et al. Almost all primes are partially regular. arXiv preprint arXiv:2602.05090, 2026 a
arXiv 2026
-
[4]
Fel's conjecture on syzygies of numerical semigroups
Chen, E., Cummins, C., Grubisic, D., et al. Fel's conjecture on syzygies of numerical semigroups. arXiv preprint arXiv:2602.03716, 2026 b
arXiv 2026
-
[5]
Chen, L. et al. Seed-Prover : Deep and broad reasoning for automated theorem proving. arXiv preprint arXiv:2507.23726, 2025
arXiv 2025
-
[6]
doi: 10.1007/978-3-030-79876-5_37
de Moura, L. and Ullrich, S. The Lean 4 theorem prover and programming language. In Automated Deduction (CADE 28), volume 12699 of Lecture Notes in Computer Science, pp.\ 625--635. Springer, 2021. doi:10.1007/978-3-030-79876-5_37
-
[7]
Kimina Lean Server : A high-performance Lean server for large-scale verification
Dos Santos, M., de Sax \'e , H., Wang, H., Wang, R., Bak s ys, M., \"U nsal, M., Liu, J., Liu, Z., and Li, J. Kimina Lean Server : A high-performance Lean server for large-scale verification. In NeurIPS 2025 Workshop on AI for Math, 2025. arXiv preprint arXiv:2504.21230
arXiv 2025
-
[8]
leanclient: A Python client for interacting with the Lean 4 language server
Dressler, O. leanclient: A Python client for interacting with the Lean 4 language server. https://github.com/oOo0oOo/leanclient, 2025
2025
-
[9]
jixia: A static analysis tool for Lean 4
FrenzyMath . jixia: A static analysis tool for Lean 4. https://github.com/frenzymath/jixia, 2024
2024
-
[10]
Safeverify: A Lean 4 script for robustly verifying submitted proofs of theorems and implementations of functions
GasStationManager . Safeverify: A Lean 4 script for robustly verifying submitted proofs of theorems and implementations of functions. https://github.com/GasStationManager/SafeVerify, 2025
2025
-
[11]
Hubert, T. et al. Olympiad-level formal mathematical reasoning with reinforcement learning. Nature, 2025. doi:10.1038/s41586-025-09833-y
-
[12]
A read--eval--print loop for Lean 4
Lean community . A read--eval--print loop for Lean 4. https://github.com/leanprover-community/repl, 2023
2023
-
[13]
lean4checker: Replay the Environment for a given Lean module
Lean FRO . lean4checker: Replay the Environment for a given Lean module. https://github.com/leanprover/lean4checker, 2023. Integrated into Lean 4 v4.28.0 as leanchecker ; repository archived 2026
2023
-
[14]
Comparator: A trustworthy judge for Lean proofs
Lean FRO . Comparator: A trustworthy judge for Lean proofs. https://github.com/leanprover/comparator, 2025
2025
-
[15]
Numina-Lean-Agent : An open and general agentic reasoning system for formal mathematics
Liu, J., Zhou, Z., Zhu, Z., Dos Santos, M., He, W., Liu, J., Wang, R., Xie, Y., Zhao, J., Wang, Q., Zhi, L., Li, J., and Li, W. Numina-Lean-Agent : An open and general agentic reasoning system for formal mathematics. arXiv preprint arXiv:2601.14027, 2026
arXiv 2026
-
[16]
Gauss : An agent for autoformalization
Math Inc. Gauss : An agent for autoformalization. https://www.math.inc/gauss, 2025
2025
-
[17]
Leanstral : An open-source code agent for Lean 4
Mistral AI . Leanstral : An open-source code agent for Lean 4. https://mistral.ai/news/leanstral, 2026
2026
-
[18]
APOLLO : Automated LLM and Lean collaboration for advanced formal reasoning
Ospanov, A., Farnia, F., and Yousefzadeh, R. APOLLO : Automated LLM and Lean collaboration for advanced formal reasoning. arXiv preprint arXiv:2505.05758, 2025
arXiv 2025
-
[19]
LeanInteract : A Python interface for Lean 4
Poiroux, A., Kun c ak, V., and Bosselut, A. LeanInteract : A Python interface for Lean 4. https://github.com/augustepoiroux/LeanInteract, 2025
2025
-
[20]
Z., Shao, Z., Song, J., Xin, H., Wang, H., Zhao, W., Zhang, L., Fu, Z., Zhu, Q., Yang, D., et al
Ren, Z. Z., Shao, Z., Song, J., Xin, H., Wang, H., Zhao, W., Zhang, L., Fu, Z., Zhu, Q., Yang, D., et al. DeepSeek-Prover-V2 : Advancing formal mathematical reasoning via reinforcement learning for subgoal decomposition. arXiv preprint arXiv:2504.21801, 2025
Pith/arXiv arXiv 2025
-
[21]
An in-context learning agent for formal theorem-proving
Thakur, A., Tsoukalas, G., Wen, Y., Xin, J., and Chaudhuri, S. An in-context learning agent for formal theorem-proving. In Conference on Language Modeling (COLM), 2024. arXiv preprint arXiv:2310.04353
arXiv 2024
-
[22]
ProofWala : Multilingual proof data synthesis and theorem-proving
Thakur, A., Tsoukalas, G., Durrett, G., and Chaudhuri, S. ProofWala : Multilingual proof data synthesis and theorem-proving. arXiv preprint arXiv:2502.04671, 2025
Pith/arXiv arXiv 2025
-
[23]
Aristotle : IMO -level automated theorem proving
The Harmonic Team . Aristotle : IMO -level automated theorem proving. arXiv preprint arXiv:2510.01346, 2025
Pith/arXiv arXiv 2025
-
[24]
The mathlib Community . The Lean mathematical library. In Proceedings of the 9th ACM SIGPLAN International Conference on Certified Programs and Proofs (CPP), pp.\ 367--381, 2020. doi:10.1145/3372885.3373824
-
[25]
Hilbert : Recursively building formal proofs with informal reasoning
Varambally, S., Voice, T., Sun, Y., Chen, Z., Yu, R., and Ye, K. Hilbert : Recursively building formal proofs with informal reasoning. arXiv preprint arXiv:2509.22819, 2025
arXiv 2025
-
[26]
Wang, H. et al. Kimina-Prover Preview : Towards large formal reasoning models with reinforcement learning. arXiv preprint arXiv:2504.11354, 2025
Pith/arXiv arXiv 2025
-
[27]
LeanDojo : Theorem proving with retrieval-augmented language models
Yang, K., Swope, A., Gu, A., Chalamala, R., Song, P., Yu, S., Godil, S., Prenger, R., and Anandkumar, A. LeanDojo : Theorem proving with retrieval-augmented language models. In Advances in Neural Information Processing Systems (NeurIPS), 2023
2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.