EconCSLib: AI-Assisted Lean Formalization for Economics & Computation research

Nikhil Garg

arxiv: 2606.13306 · v2 · pith:KFUD4QE7new · submitted 2026-06-11 · 💻 cs.GT

EconCSLib: AI-Assisted Lean Formalization for Economics & Computation research

Nikhil Garg This is my paper

Pith reviewed 2026-06-27 05:09 UTC · model grok-4.3

classification 💻 cs.GT

keywords Lean formalizationEconomics and ComputationAI-assisted verificationproof assistantsauctionsmatching marketsformal methods

0 comments

The pith

A human-AI-Lean workflow lets researchers formalize their Economics and Computation papers.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents EconCSLib, a Lean 4 library and workflow for formalizing research papers in Economics and Computation with language-model assistance. The central design principle is a human-AI-Lean workflow: an LLM writes Lean code, Lean checks formal statements and proofs, and humans verify the translation boundary from paper claims to formal statements. The library is organized around research papers, preserving their formal statements and proof structures while elevating reusable components into shared infrastructure for probability, auctions, matching markets, and graphs. This setup is author-facing so that researchers can formalize their own publications and contribute back to the library through validation reports and dependency graphs. The current repository already holds 11 fully formalized papers and 3 partially formalized ones.

Core claim

EconCSLib organizes formalizations around individual research papers, preserving their statements and proof structures to the extent possible while moving reusable mathematical statements into shared infrastructure. The workflow combines LLM code generation, Lean verification of statements and proofs, and human oversight specifically at the boundary where paper claims are translated into formal statements. The library is designed to be author-facing, with tools such as post-formalization validation reports, paper result dependency graphs, and a review dashboard to support researchers formalizing their own work.

What carries the argument

The human-AI-Lean workflow in which an LLM writes Lean code, Lean checks formal statements and proofs, and humans verify the translation boundary from paper claims to formal statements.

If this is right

Researchers can formalize their own papers, inspect the Lean translations of paper-facing statements, and contribute reusable components back to the library.
Initial shared libraries for probability, auctions, matching markets, and graph tools become available for reuse across papers.
Post-formalization validation reports, paper result dependency graphs, and a review dashboard assist the formalization process.
The library grows through author contributions of both paper-specific formalizations and reusable infrastructure.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Formal statements produced this way could serve as stable reference points when comparing models across different papers.
The same workflow structure might apply to formalization efforts in other applied fields that rely on similar combinations of probability and optimization.
If LLM translation accuracy improves, the human review step could eventually focus on fewer, higher-level checks.

Load-bearing premise

Humans can reliably detect when an LLM-generated Lean statement fails to capture the original paper's intended meaning or introduces subtle errors in economic modeling assumptions.

What would settle it

A case in which an LLM-generated formalization passes both Lean checks and human review, yet later independent analysis shows that a key modeling assumption from the source paper was altered or omitted.

Figures

Figures reproduced from arXiv: 2606.13306 by Nikhil Garg.

**Figure 2.** Figure 2: Screenshot of the paper-interface review dashboard for [ [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

**Figure 3.** Figure 3: Full dependency DAG for [GHW01], Competitive Auctions and Digital Goods. The DAG shows the SODA paper. Section 8.2 is checked against the later journal version’s monotone-auction formulation of the revenue upper bound; the preliminary unrestricted wording is tracked only as source-version provenance. 36 [PITH_FULL_IMAGE:figures/full_fig_p036_3.png] view at source ↗

read the original abstract

This paper presents EconCSLib, a Lean 4 library and workflow for formalizing research papers in applied modeling fields such as Economics and Computation, with language-model assistance. The goal of EconCSLib is to enable researchers to formalize their papers in Lean without knowing Lean themselves. The central design principle is a human-AI-Lean formalization workflow: an LLM writes Lean code, Lean checks formal statements and proofs, and both humans and LLM-as-judge processes can verify that the paper's statements were translated into Lean correctly. We develop agent skills, human-facing reporting, a review dashboard, and auditing procedures to support this workflow. The current public repository contains 20 formalized papers and 4 partially formalized papers, along with shared libraries for probability (including stochastic processes), auctions, matching markets, social choice, and graph tools, totaling 986,391 lines of Lean code. To our knowledge, we are also among the first applied math researchers to systematically pursue Lean formalization of one's own publications in the process of building such a community library. We welcome users and contributors to the project. The library and workflow are available at https://github.com/nikhgarg/EconCSLib, with corresponding project webpage at https://gargnikhil.com/EconCSLib/.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

EconCSLib ships a real Lean library with 11 formalized EconCS papers and a workable author-facing workflow, but offers almost no data on whether the formalizations match the source papers.

read the letter

The paper's main contribution is the EconCSLib repository itself: a Lean 4 setup that organizes formalizations around individual research papers, lifts reusable pieces into shared modules for probability, auctions, matching, and graphs, and includes dependency graphs plus validation reports. Eleven papers are done and the repo is public, which is concrete output rather than a plan.

The workflow description is straightforward: LLM drafts Lean code, Lean type-checks it, and a human checks the translation from paper claim to formal statement. That design is internally consistent and author-oriented, which is a reasonable way to grow a library in a small community.

The clear gap is evidence. The text gives counts of formalized papers but no error rates, no examples of caught mismatches, and no discussion of how often the human step actually fixes modeling subtleties. Without that, readers cannot tell how reliable the process is or whether it will scale past the current set. The paper is a tool announcement, not an evaluation of formalization success.

This is useful for the small group already working on Lean in EconCS or formal methods in applied math. A reader outside that niche will get little from it. The artifact is new enough and the process is described clearly enough that it belongs in peer review so the community can inspect the code and decide whether to adopt or extend it.

Referee Report

1 major / 2 minor

Summary. The paper describes EconCSLib, a Lean 4 library and associated workflow for formalizing papers in Economics and Computation using AI assistance. The workflow consists of an LLM generating Lean code, Lean verifying the statements and proofs, and humans verifying that the formal statements accurately reflect the original paper claims. The library is structured around individual papers with reusable components extracted to shared libraries. It currently includes 11 formalized papers and 3 partially formalized ones, with libraries for probability, auctions, matching markets, and graph tools. The project is open source and author-facing, with tools for validation reports and dependency graphs.

Significance. If effective, this work could significantly advance the formal verification of results in the Economics and Computation community by providing machine-checked proofs and reusable formal statements. The concrete achievement of formalizing 11 papers using the described workflow, along with the public repository, demonstrates feasibility and provides a foundation for community contributions. This is particularly valuable as it involves researchers formalizing their own publications, which helps ensure fidelity to the original claims.

major comments (1)

[Abstract] The abstract reports the number of formalized papers (11) and partially formalized (3) but does not include any metrics or evidence on the correctness of these formalizations, such as the proportion of theorems formalized per paper or any reported discrepancies found during human verification. This information is load-bearing for assessing whether the workflow reliably captures paper claims.

minor comments (2)

Consider adding a brief example in the main text of a paper statement and its Lean formalization to illustrate the translation boundary verification step.
[Abstract] The phrase 'to our knowledge, we are also among the first' could be clarified with more context on prior efforts in formalizing applied math papers.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the positive evaluation and the constructive comment on the abstract. We address it directly below.

read point-by-point responses

Referee: [Abstract] The abstract reports the number of formalized papers (11) and partially formalized (3) but does not include any metrics or evidence on the correctness of these formalizations, such as the proportion of theorems formalized per paper or any reported discrepancies found during human verification. This information is load-bearing for assessing whether the workflow reliably captures paper claims.

Authors: We agree that the abstract would be strengthened by additional context on verification. The described workflow includes an explicit human verification step (assisted by LLM) to confirm that formal statements accurately reflect the original paper claims, with any discrepancies resolved before inclusion. However, the library does not collect or report quantitative per-paper metrics such as 'proportion of theorems formalized' because formalization targets key results and reusable components rather than exhaustive coverage of every theorem in a paper; the structure is paper-centric by design. In the revised version we will update the abstract to note that every formalized paper has undergone human verification for fidelity to the source claims. Detailed per-paper validation reports and dependency information remain available in the public repository. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper is a descriptive account of a Lean library and human-AI workflow for formalizing economics papers. It contains no derivations, equations, predictions, fitted parameters, or load-bearing claims that could reduce to self-definition, self-citation chains, or renaming of known results. The central content is a process description plus concrete deliverables (11 formalized papers, shared libraries), which are externally verifiable outputs rather than internally circular reasoning. No steps meet the criteria for circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

This is a software library and workflow description paper with no mathematical derivations, fitted parameters, or postulated entities.

pith-pipeline@v0.9.1-grok · 5777 in / 1029 out tokens · 18837 ms · 2026-06-27T05:09:26.112810+00:00 · methodology

EconCSLib: AI-Assisted Lean Formalization for Economics & Computation research

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)