Digital Technical Journal , volume=

Differential testing for software , author=

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Sketch-and-Verify: Structured Inference-Time Scaling via Program Sketching

cs.LG · 2026-05-09 · conditional · novelty 7.0

Sketch-and-Verify improves small-LLM code generation on HumanEval+ by factorizing search into K algorithmic sketches and M fillings each, outperforming flat sampling by up to 32 percentage points at matched budget while remaining cheaper than upgrading model tier.

Semantic Voting: Execution-Grounded Consensus for LLM Code Generation

cs.SE · 2026-05-09 · unverdicted · novelty 6.0

Execution-based selectors for LLM code candidates outperform textual voting by large margins across configurations, with input generation quality mattering more than the specific aggregation rule.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Sketch-and-Verify: Structured Inference-Time Scaling via Program Sketching cs.LG · 2026-05-09 · conditional · none · ref 26
Sketch-and-Verify improves small-LLM code generation on HumanEval+ by factorizing search into K algorithmic sketches and M fillings each, outperforming flat sampling by up to 32 percentage points at matched budget while remaining cheaper than upgrading model tier.

Digital Technical Journal , volume=

fields

years

verdicts

representative citing papers

citing papers explorer