pith. sign in

LLMs Cannot Reliably Identify and Reason About Security Vulnerabilities (Yet?): A Comprehensive Evaluation, Framework, and Benchmarks

3 Pith papers cite this work, alongside 80 external citations. Polarity classification is still indexing.

3 Pith papers citing it
80 external citations · external index

fields

cs.SE 2 cs.CR 1

years

2026 3

verdicts

UNVERDICTED 3

representative citing papers

VulWeaver: Weaving Broken Semantics for Grounded Vulnerability Detection

cs.SE · 2026-04-12 · unverdicted · novelty 5.0

VulWeaver improves Java vulnerability detection to 0.75 F1 by enhancing dependency graphs with LLM semantic fixes, extracting full context from slices plus implicit usage info, and applying type-specific meta-prompting with majority voting.

citing papers explorer

Showing 3 of 3 citing papers.