Title resolution pending

Yin, D · 2025 · arXiv 2503.04772

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

TheoremBench: Evaluating LLMs on Theorem Proving in Formal Mathematics

cs.AI · 2026-06-08 · unverdicted · novelty 8.0

TheoremBench is a Lean4 benchmark of classical theorems in main and premised forms that evaluates LLM provers on partial progress, coverage, and token efficiency rather than binary success on competition problems.

citing papers explorer

Showing 1 of 1 citing paper after filters.

TheoremBench: Evaluating LLMs on Theorem Proving in Formal Mathematics cs.AI · 2026-06-08 · unverdicted · none · ref 40
TheoremBench is a Lean4 benchmark of classical theorems in main and premised forms that evaluates LLM provers on partial progress, coverage, and token efficiency rather than binary success on competition problems.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer