AXLE: A Cloud Infrastructure for Lean 4 Theorem Proving Utilities

Alex Schneidman; Chris Cummins; Jannis Limperg; Jimmy Xin; Karun Ram; Srihari Ganesh

arxiv: 2606.26442 · v1 · pith:4SFWZRV7new · submitted 2026-06-24 · 💻 cs.LO · cs.AI

AXLE: A Cloud Infrastructure for Lean 4 Theorem Proving Utilities

Jimmy Xin , Alex Schneidman , Chris Cummins , Karun Ram , Srihari Ganesh , Jannis Limperg This is my paper

Pith reviewed 2026-06-26 00:24 UTC · model grok-4.3

classification 💻 cs.LO cs.AI

keywords Lean 4theorem provingcloud servicemetaprogrammingproof verificationAI for mathematicsmulti-tenant deploymentproof repair

0 comments

The pith

AXLE delivers 14 Lean 4 metaprogramming tools through a multi-tenant cloud service with per-request isolation and multi-version support.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents AXLE as a publicly available cloud service that supplies Lean 4 utilities for strict proof verification, declaration metadata extraction, semantic source manipulation, deterministic proof repair, and lemma extraction. It addresses the need for scalable tooling in AI-driven mathematics workflows by running as a multi-tenant deployment that handles concurrent requests across multiple Lean 4 and Mathlib versions without requiring local installation. The service has processed over 500 million requests and underpins high-stakes proving efforts such as a perfect score on the 2025 Putnam competition. A sympathetic reader would care because it removes infrastructure barriers that currently limit large-scale automated theorem proving and dataset work.

Core claim

AXLE is a cloud service that exposes 14 Lean 4 metaprogramming tools for proof verification, metadata extraction, source manipulation, deterministic repair and simplification, and lemma extraction; it operates as a multi-tenant deployment with per-request isolation, concurrent support for multiple Lean 4 and Mathlib versions, and access through Python SDK, CLI, web UI, MCP server, and HTTP API, having served over 500 million requests while powering Axiom Math's proving results.

What carries the argument

The multi-tenant cloud deployment with per-request isolation that runs the 14 metaprogramming tools while supporting multiple Lean 4 and Mathlib versions simultaneously.

If this is right

AI reinforcement learning pipelines and agentic proving workflows can issue millions of Lean 4 operations without managing local installations or version conflicts.
Dataset curation for mathematics can extract lemmas and manipulate source code at the throughput required by modern training runs.
Proof repair and simplification become deterministic operations available on demand to any user or script.
Multiple research groups can share the same infrastructure while each request remains isolated from the others.
Public access removes the requirement for deep Lean 4 installation expertise before experimenting with formal proof.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Teams without dedicated DevOps resources for Lean 4 could still run large-scale verification experiments by calling the service.
The same isolation model might later support other interactive theorem provers if similar metaprogramming layers are added.
Widespread use could create public logs of common proof patterns that feed into future lemma suggestion systems.
Integration with existing Python ML stacks becomes straightforward through the provided SDK.

Load-bearing premise

The multi-tenant cloud architecture maintains correctness, robustness, and version compatibility at the claimed scale without introducing interference or verification errors.

What would settle it

A case where two concurrent requests using different Lean 4 versions produce inconsistent verification outcomes on the same declaration, or where a request returns an incorrect proof status compared with an independent local Lean 4 execution.

read the original abstract

We present AXLE (Axiom Lean Engine), a cloud service for Lean 4 proof manipulation, extraction, and verification. Recent progress in AI for mathematics -- reinforcement learning pipelines, agentic proving workflows, dataset curation -- demands Lean 4 tooling that scales to millions of requests while remaining correct and robust; existing infrastructure offers parallel compilation but not scalable proof verification, higher-level proof manipulation, multi-version support, or per-request isolation at the throughput modern AI workflows require. AXLE provides 14 Lean 4 metaprogramming tools spanning strict proof verification, declaration metadata extraction, semantic source manipulation, deterministic proof repair and simplification, and lemma extraction. The service runs as a multi-tenant cloud deployment with per-request isolation and concurrent support for multiple Lean 4 and Mathlib versions, accessible via a Python SDK, command-line interface, web UI, MCP server, and raw HTTP API. AXLE is publicly available and free to use at https://axle.axiommath.ai and via the axiom-axle PyPI package, with no local Lean 4 installation required. It has served over 500 million requests to date and is the underlying infrastructure for Axiom Math's proving efforts, including its 12/12 score on the 2025 Putnam competition.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

AXLE is a live cloud service offering 14 Lean 4 tools with isolation and multi-version support that targets AI-scale proving needs, but the paper gives almost no performance data or implementation specifics.

read the letter

AXLE is a cloud service that bundles 14 Lean 4 metaprogramming tools and exposes them through a Python SDK, CLI, web UI, and HTTP API without any local Lean install. It supports per-request isolation and runs multiple Lean and Mathlib versions concurrently. The authors report it has handled over 500 million requests and underpins Axiom Math's 12/12 Putnam result.

The useful part is the integrated offering itself. Existing Lean infrastructure handles parallel compilation but not the combination of higher-level proof manipulation, deterministic repair, lemma extraction, and scalable multi-tenant access that modern reinforcement-learning and agent pipelines require. Putting this behind public endpoints and removing the local-install barrier is a concrete step that some groups will actually use.

The paper describes the tool categories clearly enough and notes the live deployment. That the service is externally reachable means the headline claims can be checked directly, which is better than pure assertion.

The main weakness is the absence of any numbers. There are no latency or throughput measurements, no error rates, no description of how isolation is enforced, and no account of how version compatibility or correctness is preserved at the claimed scale. The Putnam connection is stated but not tied to specific AXLE features or compared with other setups. Without those details the paper reads as an announcement rather than an evaluation.

This is for people running large Lean experiments or building AI math systems who need cloud access. It is not a theoretical result. The work is coherent on its own terms and the claims are testable, so it deserves a serious referee to see whether the authors can supply the missing measurements and architecture notes.

Referee Report

0 major / 2 minor

Summary. The paper presents AXLE (Axiom Lean Engine), a publicly available cloud service for Lean 4 proof manipulation, extraction, and verification. It implements 14 metaprogramming tools covering strict proof verification, declaration metadata extraction, semantic source manipulation, deterministic proof repair and simplification, and lemma extraction. The service is deployed as a multi-tenant cloud infrastructure with per-request isolation and concurrent support for multiple Lean 4 and Mathlib versions. Access is provided via Python SDK, CLI, web UI, MCP server, and raw HTTP API with no local Lean 4 installation required. The paper states that AXLE has served over 500 million requests and serves as the underlying infrastructure for Axiom Math's proving efforts, including a 12/12 score on the 2025 Putnam competition.

Significance. If the described capabilities and deployment hold, AXLE provides a practical, scalable solution to a recognized bottleneck in AI-for-mathematics research: the need for high-throughput, correct, and isolated Lean 4 interactions without local setup. The public endpoint, multi-version support, and per-request isolation directly address limitations of existing parallel-compilation tools. Credit is due for the explicit public release, the SDK, and the demonstrated downstream use in a high-visibility proving task. These elements position the work as a useful systems contribution that can be immediately adopted by reinforcement-learning pipelines, agentic provers, and dataset curators.

minor comments (2)

[Abstract] Abstract: the assertion of 'feature completeness' and 'strict proof verification' is presented at a high level without enumerating the 14 tools or distinguishing the verification mode from standard Lean kernel checks; a concise table or enumerated list in the main text would improve clarity.
[Abstract] The manuscript supplies no high-level performance indicators (e.g., average latency, error rate under concurrent load, or version-compatibility test coverage) even though the central narrative emphasizes scale and robustness; while external verification is possible via the public endpoint, such indicators would make the paper more self-contained.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their thorough review, positive assessment of AXLE's contributions, and recommendation to accept the manuscript. We are pleased that the work is recognized as addressing a practical bottleneck in AI-for-mathematics tooling through its public release, multi-version support, per-request isolation, and demonstrated high-volume usage.

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper is a systems description of a deployed cloud service providing Lean 4 metaprogramming tools. It contains no derivations, equations, predictions, fitted parameters, or load-bearing self-citations. All central claims (existence of 14 tools, multi-tenant architecture, request volume, Putnam usage) are presented as directly observable facts about a public service rather than results computed from internal inputs that could reduce to those inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

This is an infrastructure and systems paper describing a deployed service; the abstract contains no mathematical derivations, fitted parameters, background axioms, or postulated entities.

pith-pipeline@v0.9.1-grok · 5772 in / 1207 out tokens · 37308 ms · 2026-06-26T00:24:08.537342+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

27 extracted references · 4 canonical work pages

[1]

Pantograph : A machine-to-machine interaction interface for advanced theorem proving, high-level reasoning, and data extraction in Lean 4

Aniva, L., Sun, C., Miranda, B., Barrett, C., and Koyejo, S. Pantograph : A machine-to-machine interaction interface for advanced theorem proving, high-level reasoning, and data extraction in Lean 4. In Gurfinkel, A. and Heule, M. (eds.), Tools and Algorithms for the Construction and Analysis of Systems (TACAS), pp.\ 104--123. Springer Nature Switzerland,...

work page doi:10.1007/978-3-031-90643-5_6 2025
[2]

Goedel-workbook-lean-4.27: Lean workbook competition-math proofs from DeepSeek-Prover-V1.5 , migrated to Lean 4.27.0 / Mathlib v4.27.0

banach1729 . Goedel-workbook-lean-4.27: Lean workbook competition-math proofs from DeepSeek-Prover-V1.5 , migrated to Lean 4.27.0 / Mathlib v4.27.0. https://huggingface.co/datasets/banach1729/goedel-workbook-lean427, 2026

2026
[3]

Almost all primes are partially regular

Chen, E., Cummins, C., Eltschig, B., et al. Almost all primes are partially regular. arXiv preprint arXiv:2602.05090, 2026 a

arXiv 2026
[4]

Fel's conjecture on syzygies of numerical semigroups

Chen, E., Cummins, C., Grubisic, D., et al. Fel's conjecture on syzygies of numerical semigroups. arXiv preprint arXiv:2602.03716, 2026 b

arXiv 2026
[5]

Chen, L. et al. Seed-Prover : Deep and broad reasoning for automated theorem proving. arXiv preprint arXiv:2507.23726, 2025

arXiv 2025
[6]

doi: 10.1007/978-3-030-79876-5_37

de Moura, L. and Ullrich, S. The Lean 4 theorem prover and programming language. In Automated Deduction (CADE 28), volume 12699 of Lecture Notes in Computer Science, pp.\ 625--635. Springer, 2021. doi:10.1007/978-3-030-79876-5_37

work page doi:10.1007/978-3-030-79876-5_37 2021
[7]

Kimina Lean Server : A high-performance Lean server for large-scale verification

Dos Santos, M., de Sax \'e , H., Wang, H., Wang, R., Bak s ys, M., \"U nsal, M., Liu, J., Liu, Z., and Li, J. Kimina Lean Server : A high-performance Lean server for large-scale verification. In NeurIPS 2025 Workshop on AI for Math, 2025. arXiv preprint arXiv:2504.21230

arXiv 2025
[8]

leanclient: A Python client for interacting with the Lean 4 language server

Dressler, O. leanclient: A Python client for interacting with the Lean 4 language server. https://github.com/oOo0oOo/leanclient, 2025

2025
[9]

jixia: A static analysis tool for Lean 4

FrenzyMath . jixia: A static analysis tool for Lean 4. https://github.com/frenzymath/jixia, 2024

2024
[10]

Safeverify: A Lean 4 script for robustly verifying submitted proofs of theorems and implementations of functions

GasStationManager . Safeverify: A Lean 4 script for robustly verifying submitted proofs of theorems and implementations of functions. https://github.com/GasStationManager/SafeVerify, 2025

2025
[11]

Hubert, T. et al. Olympiad-level formal mathematical reasoning with reinforcement learning. Nature, 2025. doi:10.1038/s41586-025-09833-y

work page doi:10.1038/s41586-025-09833-y 2025
[12]

A read--eval--print loop for Lean 4

Lean community . A read--eval--print loop for Lean 4. https://github.com/leanprover-community/repl, 2023

2023
[13]

lean4checker: Replay the Environment for a given Lean module

Lean FRO . lean4checker: Replay the Environment for a given Lean module. https://github.com/leanprover/lean4checker, 2023. Integrated into Lean 4 v4.28.0 as leanchecker ; repository archived 2026

2023
[14]

Comparator: A trustworthy judge for Lean proofs

Lean FRO . Comparator: A trustworthy judge for Lean proofs. https://github.com/leanprover/comparator, 2025

2025
[15]

Numina-Lean-Agent : An open and general agentic reasoning system for formal mathematics

Liu, J., Zhou, Z., Zhu, Z., Dos Santos, M., He, W., Liu, J., Wang, R., Xie, Y., Zhao, J., Wang, Q., Zhi, L., Li, J., and Li, W. Numina-Lean-Agent : An open and general agentic reasoning system for formal mathematics. arXiv preprint arXiv:2601.14027, 2026

arXiv 2026
[16]

Gauss : An agent for autoformalization

Math Inc. Gauss : An agent for autoformalization. https://www.math.inc/gauss, 2025

2025
[17]

Leanstral : An open-source code agent for Lean 4

Mistral AI . Leanstral : An open-source code agent for Lean 4. https://mistral.ai/news/leanstral, 2026

2026
[18]

APOLLO : Automated LLM and Lean collaboration for advanced formal reasoning

Ospanov, A., Farnia, F., and Yousefzadeh, R. APOLLO : Automated LLM and Lean collaboration for advanced formal reasoning. arXiv preprint arXiv:2505.05758, 2025

arXiv 2025
[19]

LeanInteract : A Python interface for Lean 4

Poiroux, A., Kun c ak, V., and Bosselut, A. LeanInteract : A Python interface for Lean 4. https://github.com/augustepoiroux/LeanInteract, 2025

2025
[20]

Z., Shao, Z., Song, J., Xin, H., Wang, H., Zhao, W., Zhang, L., Fu, Z., Zhu, Q., Yang, D., et al

Ren, Z. Z., Shao, Z., Song, J., Xin, H., Wang, H., Zhao, W., Zhang, L., Fu, Z., Zhu, Q., Yang, D., et al. DeepSeek-Prover-V2 : Advancing formal mathematical reasoning via reinforcement learning for subgoal decomposition. arXiv preprint arXiv:2504.21801, 2025

Pith/arXiv arXiv 2025
[21]

An in-context learning agent for formal theorem-proving

Thakur, A., Tsoukalas, G., Wen, Y., Xin, J., and Chaudhuri, S. An in-context learning agent for formal theorem-proving. In Conference on Language Modeling (COLM), 2024. arXiv preprint arXiv:2310.04353

arXiv 2024
[22]

ProofWala : Multilingual proof data synthesis and theorem-proving

Thakur, A., Tsoukalas, G., Durrett, G., and Chaudhuri, S. ProofWala : Multilingual proof data synthesis and theorem-proving. arXiv preprint arXiv:2502.04671, 2025

Pith/arXiv arXiv 2025
[23]

Aristotle : IMO -level automated theorem proving

The Harmonic Team . Aristotle : IMO -level automated theorem proving. arXiv preprint arXiv:2510.01346, 2025

Pith/arXiv arXiv 2025
[24]

The lean mathematical library

The mathlib Community . The Lean mathematical library. In Proceedings of the 9th ACM SIGPLAN International Conference on Certified Programs and Proofs (CPP), pp.\ 367--381, 2020. doi:10.1145/3372885.3373824

work page doi:10.1145/3372885.3373824 2020
[25]

Hilbert : Recursively building formal proofs with informal reasoning

Varambally, S., Voice, T., Sun, Y., Chen, Z., Yu, R., and Ye, K. Hilbert : Recursively building formal proofs with informal reasoning. arXiv preprint arXiv:2509.22819, 2025

arXiv 2025
[26]

Wang, H. et al. Kimina-Prover Preview : Towards large formal reasoning models with reinforcement learning. arXiv preprint arXiv:2504.11354, 2025

Pith/arXiv arXiv 2025
[27]

LeanDojo : Theorem proving with retrieval-augmented language models

Yang, K., Swope, A., Gu, A., Chalamala, R., Song, P., Yu, S., Godil, S., Prenger, R., and Anandkumar, A. LeanDojo : Theorem proving with retrieval-augmented language models. In Advances in Neural Information Processing Systems (NeurIPS), 2023

2023

[1] [1]

Pantograph : A machine-to-machine interaction interface for advanced theorem proving, high-level reasoning, and data extraction in Lean 4

Aniva, L., Sun, C., Miranda, B., Barrett, C., and Koyejo, S. Pantograph : A machine-to-machine interaction interface for advanced theorem proving, high-level reasoning, and data extraction in Lean 4. In Gurfinkel, A. and Heule, M. (eds.), Tools and Algorithms for the Construction and Analysis of Systems (TACAS), pp.\ 104--123. Springer Nature Switzerland,...

work page doi:10.1007/978-3-031-90643-5_6 2025

[2] [2]

Goedel-workbook-lean-4.27: Lean workbook competition-math proofs from DeepSeek-Prover-V1.5 , migrated to Lean 4.27.0 / Mathlib v4.27.0

banach1729 . Goedel-workbook-lean-4.27: Lean workbook competition-math proofs from DeepSeek-Prover-V1.5 , migrated to Lean 4.27.0 / Mathlib v4.27.0. https://huggingface.co/datasets/banach1729/goedel-workbook-lean427, 2026

2026

[3] [3]

Almost all primes are partially regular

Chen, E., Cummins, C., Eltschig, B., et al. Almost all primes are partially regular. arXiv preprint arXiv:2602.05090, 2026 a

arXiv 2026

[4] [4]

Fel's conjecture on syzygies of numerical semigroups

Chen, E., Cummins, C., Grubisic, D., et al. Fel's conjecture on syzygies of numerical semigroups. arXiv preprint arXiv:2602.03716, 2026 b

arXiv 2026

[5] [5]

Chen, L. et al. Seed-Prover : Deep and broad reasoning for automated theorem proving. arXiv preprint arXiv:2507.23726, 2025

arXiv 2025

[6] [6]

doi: 10.1007/978-3-030-79876-5_37

de Moura, L. and Ullrich, S. The Lean 4 theorem prover and programming language. In Automated Deduction (CADE 28), volume 12699 of Lecture Notes in Computer Science, pp.\ 625--635. Springer, 2021. doi:10.1007/978-3-030-79876-5_37

work page doi:10.1007/978-3-030-79876-5_37 2021

[7] [7]

Kimina Lean Server : A high-performance Lean server for large-scale verification

Dos Santos, M., de Sax \'e , H., Wang, H., Wang, R., Bak s ys, M., \"U nsal, M., Liu, J., Liu, Z., and Li, J. Kimina Lean Server : A high-performance Lean server for large-scale verification. In NeurIPS 2025 Workshop on AI for Math, 2025. arXiv preprint arXiv:2504.21230

arXiv 2025

[8] [8]

leanclient: A Python client for interacting with the Lean 4 language server

Dressler, O. leanclient: A Python client for interacting with the Lean 4 language server. https://github.com/oOo0oOo/leanclient, 2025

2025

[9] [9]

jixia: A static analysis tool for Lean 4

FrenzyMath . jixia: A static analysis tool for Lean 4. https://github.com/frenzymath/jixia, 2024

2024

[10] [10]

Safeverify: A Lean 4 script for robustly verifying submitted proofs of theorems and implementations of functions

GasStationManager . Safeverify: A Lean 4 script for robustly verifying submitted proofs of theorems and implementations of functions. https://github.com/GasStationManager/SafeVerify, 2025

2025

[11] [11]

Hubert, T. et al. Olympiad-level formal mathematical reasoning with reinforcement learning. Nature, 2025. doi:10.1038/s41586-025-09833-y

work page doi:10.1038/s41586-025-09833-y 2025

[12] [12]

A read--eval--print loop for Lean 4

Lean community . A read--eval--print loop for Lean 4. https://github.com/leanprover-community/repl, 2023

2023

[13] [13]

lean4checker: Replay the Environment for a given Lean module

Lean FRO . lean4checker: Replay the Environment for a given Lean module. https://github.com/leanprover/lean4checker, 2023. Integrated into Lean 4 v4.28.0 as leanchecker ; repository archived 2026

2023

[14] [14]

Comparator: A trustworthy judge for Lean proofs

Lean FRO . Comparator: A trustworthy judge for Lean proofs. https://github.com/leanprover/comparator, 2025

2025

[15] [15]

Numina-Lean-Agent : An open and general agentic reasoning system for formal mathematics

Liu, J., Zhou, Z., Zhu, Z., Dos Santos, M., He, W., Liu, J., Wang, R., Xie, Y., Zhao, J., Wang, Q., Zhi, L., Li, J., and Li, W. Numina-Lean-Agent : An open and general agentic reasoning system for formal mathematics. arXiv preprint arXiv:2601.14027, 2026

arXiv 2026

[16] [16]

Gauss : An agent for autoformalization

Math Inc. Gauss : An agent for autoformalization. https://www.math.inc/gauss, 2025

2025

[17] [17]

Leanstral : An open-source code agent for Lean 4

Mistral AI . Leanstral : An open-source code agent for Lean 4. https://mistral.ai/news/leanstral, 2026

2026

[18] [18]

APOLLO : Automated LLM and Lean collaboration for advanced formal reasoning

Ospanov, A., Farnia, F., and Yousefzadeh, R. APOLLO : Automated LLM and Lean collaboration for advanced formal reasoning. arXiv preprint arXiv:2505.05758, 2025

arXiv 2025

[19] [19]

LeanInteract : A Python interface for Lean 4

Poiroux, A., Kun c ak, V., and Bosselut, A. LeanInteract : A Python interface for Lean 4. https://github.com/augustepoiroux/LeanInteract, 2025

2025

[20] [20]

Z., Shao, Z., Song, J., Xin, H., Wang, H., Zhao, W., Zhang, L., Fu, Z., Zhu, Q., Yang, D., et al

Ren, Z. Z., Shao, Z., Song, J., Xin, H., Wang, H., Zhao, W., Zhang, L., Fu, Z., Zhu, Q., Yang, D., et al. DeepSeek-Prover-V2 : Advancing formal mathematical reasoning via reinforcement learning for subgoal decomposition. arXiv preprint arXiv:2504.21801, 2025

Pith/arXiv arXiv 2025

[21] [21]

An in-context learning agent for formal theorem-proving

Thakur, A., Tsoukalas, G., Wen, Y., Xin, J., and Chaudhuri, S. An in-context learning agent for formal theorem-proving. In Conference on Language Modeling (COLM), 2024. arXiv preprint arXiv:2310.04353

arXiv 2024

[22] [22]

ProofWala : Multilingual proof data synthesis and theorem-proving

Thakur, A., Tsoukalas, G., Durrett, G., and Chaudhuri, S. ProofWala : Multilingual proof data synthesis and theorem-proving. arXiv preprint arXiv:2502.04671, 2025

Pith/arXiv arXiv 2025

[23] [23]

Aristotle : IMO -level automated theorem proving

The Harmonic Team . Aristotle : IMO -level automated theorem proving. arXiv preprint arXiv:2510.01346, 2025

Pith/arXiv arXiv 2025

[24] [24]

The lean mathematical library

The mathlib Community . The Lean mathematical library. In Proceedings of the 9th ACM SIGPLAN International Conference on Certified Programs and Proofs (CPP), pp.\ 367--381, 2020. doi:10.1145/3372885.3373824

work page doi:10.1145/3372885.3373824 2020

[25] [25]

Hilbert : Recursively building formal proofs with informal reasoning

Varambally, S., Voice, T., Sun, Y., Chen, Z., Yu, R., and Ye, K. Hilbert : Recursively building formal proofs with informal reasoning. arXiv preprint arXiv:2509.22819, 2025

arXiv 2025

[26] [26]

Wang, H. et al. Kimina-Prover Preview : Towards large formal reasoning models with reinforcement learning. arXiv preprint arXiv:2504.11354, 2025

Pith/arXiv arXiv 2025

[27] [27]

LeanDojo : Theorem proving with retrieval-augmented language models

Yang, K., Swope, A., Gu, A., Chalamala, R., Song, P., Yu, S., Godil, S., Prenger, R., and Anandkumar, A. LeanDojo : Theorem proving with retrieval-augmented language models. In Advances in Neural Information Processing Systems (NeurIPS), 2023

2023