CrypFormBench is a new benchmark jointly covering symbolic and computational security to evaluate LLMs on five formal analysis capabilities, with results showing top model Claude-3.5 scores 48.7/100 and most models struggling on generation, transformation, and correction.
In CRYPTO (LNCS, Vol
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CR 2years
2026 2representative citing papers
The paper fixes two bugs in Go's extendedGCD for RSA key generation, proves the corrected version correct and terminating via Gobra with Lean lemmata, and reports a 24% speedup.
citing papers explorer
-
CrypFormBench: Benchmarking Formal Analysis Capability of Large Language Models for Cryptographic Schemes
CrypFormBench is a new benchmark jointly covering symbolic and computational security to evaluate LLMs on five formal analysis capabilities, with results showing top model Claude-3.5 scores 48.7/100 and most models struggling on generation, transformation, and correction.
-
GCD: Garbled, Corrected, Demonstrandum -- Fixing and Proving Go's Extended GCD Implementation
The paper fixes two bugs in Go's extendedGCD for RSA key generation, proves the corrected version correct and terminating via Gobra with Lean lemmata, and reports a 24% speedup.