hub

Lahiri, and Sid- dhartha Sen

Haoran Yang, Weile Lian, Shaowei Wang, Haipeng Cai · 2023

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

browse 11 citing papers

hub tools

JSON dossier citing papers JSON

representative citing papers

Correct Code, Vulnerable Dependencies: A Large Scale Measurement Study of LLM-Specified Library Versions

cs.SE · 2026-05-07 · conditional · novelty 8.0 · 2 refs

LLMs frequently specify library versions with known CVEs in generated code (36-56% of tasks), show low compatibility (20-63%), and converge on the same risky versions across models.

SiblingRepair: Sibling-Based Multi-Hunk Repair with Large Language Models

cs.SE · 2026-05-07 · unverdicted · novelty 7.0

SiblingRepair uses LLMs with semantic sibling detection and simultaneous/iterative repair strategies to outperform prior multi-hunk APR tools like Hercules on Defects4J and GHRB benchmarks.

Understanding Bugs in Template Engine-Based Applications: Symptoms, Root Causes, and Fix Patterns

cs.SE · 2026-04-30 · unverdicted · novelty 7.0 · 2 refs

An empirical study of 1,004 bugs in template engine-based applications finds abnormal rendering results as the most common symptom (48.61%) and documents 17 root causes with fix patterns that often involve host-side logic changes.

PuzzleMark: Implicit Jigsaw Learning for Robust Code Dataset Watermarking in Neural Code Completion Models

cs.SE · 2026-04-30 · unverdicted · novelty 7.0

PuzzleMark provides a robust and imperceptible watermarking method for code datasets using adaptive variable name concatenation and statistical verification, achieving perfect detection rates with minimal performance impact.

When Prompt Under-Specification Improves Code Correctness: An Exploratory Study of Prompt Wording and Structure Effects on LLM-Based Code Generation

cs.SE · 2026-04-27 · unverdicted · novelty 7.0

Structurally rich task descriptions make LLMs robust to prompt under-specification, and under-specification can enhance code correctness by disrupting misleading lexical or structural cues.

Defective Task Descriptions in LLM-Based Code Generation: Detection and Analysis

cs.SE · 2026-04-27 · conditional · novelty 6.0

SpecValidator detects lexical vagueness, under-specification, and syntax-formatting defects in LLM code-generation prompts with F1 0.804, outperforming GPT-5-mini and Claude Sonnet 4, and shows that under-specification is the most damaging defect type while richer benchmarks are more resilient.

Exploring the Effectiveness of Abstract Syntax Tree Patterns for Algorithm Recognition

cs.SE · 2026-05-07 · unverdicted · novelty 5.0

An AST pattern-matching prototype with a custom DSL achieves 0.74 average F1-score on a BigCloneEval subset, outperforming CodeLlama (0.35) and code clone detectors (best recall 0.20).

On the Effectiveness of Modular Testing in EvoSuite

cs.SE · 2026-04-29 · unverdicted · novelty 5.0

Emote enhances EvoSuite by allowing non-target setup calls in modular tests and refocusing the fitness function on the target call chain, delivering 15.15% higher target method coverage on an SF100 subset.

eDySec: A Deep Learning-based Explainable Dynamic Analysis Framework for Detecting Malicious Packages in PyPI Ecosystem

cs.CR · 2026-04-29 · unverdicted · novelty 5.0

eDySec is a deep learning-based framework that detects malicious PyPI packages through dynamic analysis, halving feature dimensionality, reducing false positives by 82%, false negatives by 79%, and boosting accuracy by 3% with near-perfect stability.

Vulnerability Identification by Harnessing Inter-connected Multi-Source Information

cs.SE · 2026-04-27 · unverdicted · novelty 5.0

VPFinder integrates multi-source semantic information using multi-head attention to achieve 0.941 F1-score in vulnerability identification and 0.610 F1-score in type classification, outperforming prior approaches.

FixV2W: Correcting Invalid CVE-CWE Mappings with Knowledge Graph Embeddings

cs.CR · 2026-04-24 · unverdicted · novelty 5.0

FixV2W uses knowledge graph embeddings plus longitudinal patterns to fix invalid CVE-CWE mappings, correctly predicting the right CWE for 69% of exploited cases in top-10 rankings and raising ML model MRR from 0.174 to 0.608.

citing papers explorer

Showing 11 of 11 citing papers.

Correct Code, Vulnerable Dependencies: A Large Scale Measurement Study of LLM-Specified Library Versions cs.SE · 2026-05-07 · conditional · none · ref 6 · 2 links
LLMs frequently specify library versions with known CVEs in generated code (36-56% of tasks), show low compatibility (20-63%), and converge on the same risky versions across models.
SiblingRepair: Sibling-Based Multi-Hunk Repair with Large Language Models cs.SE · 2026-05-07 · unverdicted · none · ref 9
SiblingRepair uses LLMs with semantic sibling detection and simultaneous/iterative repair strategies to outperform prior multi-hunk APR tools like Hercules on Defects4J and GHRB benchmarks.
Understanding Bugs in Template Engine-Based Applications: Symptoms, Root Causes, and Fix Patterns cs.SE · 2026-04-30 · unverdicted · none · ref 28 · 2 links
An empirical study of 1,004 bugs in template engine-based applications finds abnormal rendering results as the most common symptom (48.61%) and documents 17 root causes with fix patterns that often involve host-side logic changes.
PuzzleMark: Implicit Jigsaw Learning for Robust Code Dataset Watermarking in Neural Code Completion Models cs.SE · 2026-04-30 · unverdicted · none · ref 7
PuzzleMark provides a robust and imperceptible watermarking method for code datasets using adaptive variable name concatenation and statistical verification, achieving perfect detection rates with minimal performance impact.
When Prompt Under-Specification Improves Code Correctness: An Exploratory Study of Prompt Wording and Structure Effects on LLM-Based Code Generation cs.SE · 2026-04-27 · unverdicted · none · ref 25
Structurally rich task descriptions make LLMs robust to prompt under-specification, and under-specification can enhance code correctness by disrupting misleading lexical or structural cues.
Defective Task Descriptions in LLM-Based Code Generation: Detection and Analysis cs.SE · 2026-04-27 · conditional · none · ref 25
SpecValidator detects lexical vagueness, under-specification, and syntax-formatting defects in LLM code-generation prompts with F1 0.804, outperforming GPT-5-mini and Claude Sonnet 4, and shows that under-specification is the most damaging defect type while richer benchmarks are more resilient.
Exploring the Effectiveness of Abstract Syntax Tree Patterns for Algorithm Recognition cs.SE · 2026-05-07 · unverdicted · none · ref 35
An AST pattern-matching prototype with a custom DSL achieves 0.74 average F1-score on a BigCloneEval subset, outperforming CodeLlama (0.35) and code clone detectors (best recall 0.20).
On the Effectiveness of Modular Testing in EvoSuite cs.SE · 2026-04-29 · unverdicted · none · ref 15
Emote enhances EvoSuite by allowing non-target setup calls in modular tests and refocusing the fitness function on the target call chain, delivering 15.15% higher target method coverage on an SF100 subset.
eDySec: A Deep Learning-based Explainable Dynamic Analysis Framework for Detecting Malicious Packages in PyPI Ecosystem cs.CR · 2026-04-29 · unverdicted · none · ref 62
eDySec is a deep learning-based framework that detects malicious PyPI packages through dynamic analysis, halving feature dimensionality, reducing false positives by 82%, false negatives by 79%, and boosting accuracy by 3% with near-perfect stability.
Vulnerability Identification by Harnessing Inter-connected Multi-Source Information cs.SE · 2026-04-27 · unverdicted · none · ref 1
VPFinder integrates multi-source semantic information using multi-head attention to achieve 0.941 F1-score in vulnerability identification and 0.610 F1-score in type classification, outperforming prior approaches.
FixV2W: Correcting Invalid CVE-CWE Mappings with Knowledge Graph Embeddings cs.CR · 2026-04-24 · unverdicted · none · ref 14
FixV2W uses knowledge graph embeddings plus longitudinal patterns to fix invalid CVE-CWE mappings, correctly predicting the right CWE for 69% of exploited cases in top-10 rankings and raising ML model MRR from 0.174 to 0.608.

Lahiri, and Sid- dhartha Sen

hub tools

fields

years

verdicts

representative citing papers

citing papers explorer