CrypFormBench is a new benchmark jointly covering symbolic and computational security to evaluate LLMs on five formal analysis capabilities, with results showing top model Claude-3.5 scores 48.7/100 and most models struggling on generation, transformation, and correction.
IEEE Transactions on Computational Social Sys- tems10(3), 1039–1056 (2023) https://doi.or g/10.1109/TCSS.2022.3162869
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
years
2026 3verdicts
UNVERDICTED 3representative citing papers
Evidence-based taxonomy of security properties with first-order logic definitions and ProVerif/Tamarin executable examples derived from a 2022-2025 literature review of 53 studies.
Pretraining on broad sound events plus on-the-fly augmentations improves out-of-domain true-positive rates for acoustic drone detection at fixed low false-positive rates.
citing papers explorer
-
Improving acoustic drone detection generalization through pretraining and data augmentation
Pretraining on broad sound events plus on-the-fly augmentations improves out-of-domain true-positive rates for acoustic drone detection at fixed low false-positive rates.