Raghu Reddy

Brahma Reddy Korraprolu, Pavitra Pinninti · 2025 · arXiv 7383.371738

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Sakura: An Approach for Generating Complex Tests from Natural Language Test Descriptions

cs.SE · 2026-05-30 · unverdicted · novelty 7.0

Sakura is a multi-agent system that generates structurally complex tests from NL descriptions, achieving 50-78% higher compilability and 38-66% higher coverage overlap than baselines on 1,464 scenarios from 20 Apache Commons applications.

ContractEval: A Benchmark for Evaluating Contract-Satisfying Assertions in Code Generation

cs.AI · 2025-10-14 · unverdicted · novelty 7.0

ContractEval benchmark on 364 tasks shows code LLMs achieve 75-82% functional pass@1 but 0% contract satisfaction under standard prompting, rising only to 23-41% with explicit contracts.

citing papers explorer

Showing 2 of 2 citing papers.

Sakura: An Approach for Generating Complex Tests from Natural Language Test Descriptions cs.SE · 2026-05-30 · unverdicted · none · ref 45
Sakura is a multi-agent system that generates structurally complex tests from NL descriptions, achieving 50-78% higher compilability and 38-66% higher coverage overlap than baselines on 1,464 scenarios from 20 Apache Commons applications.
ContractEval: A Benchmark for Evaluating Contract-Satisfying Assertions in Code Generation cs.AI · 2025-10-14 · unverdicted · none · ref 17
ContractEval benchmark on 364 tasks shows code LLMs achieve 75-82% functional pass@1 but 0% contract satisfaction under standard prompting, rising only to 23-41% with explicit contracts.

Raghu Reddy

fields

years

verdicts

representative citing papers

citing papers explorer