hub Canonical reference

Test-driven development and llm-based code generation,

· 2024 · arXiv 1620.369552

Canonical reference. 100% of citing Pith papers cite this work as background.

10 Pith papers citing it

Background 100% of classified citations

read on arXiv browse 10 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 5

citation-polarity summary

background 5

representative citing papers

From Runnable to Shippable: Multi-Agent Test-Driven Development for Generating Full-Stack Web Applications from Requirements

cs.SE · 2026-05-17 · unverdicted · novelty 7.0

TDDev automates the full TDD loop for web app generation from requirements, delivering 34-48 percentage point quality gains and zero manual intervention in user studies.

SkillOps: Managing LLM Agent Skill Libraries as Self-Maintaining Software Ecosystems

cs.SE · 2026-05-13 · unverdicted · novelty 7.0

SkillOps maintains LLM skill libraries via Skill Contracts and ecosystem graphs, raising ALFWorld task success to 79.5% as a standalone agent and improving retrieval baselines by up to 2.9 points with near-zero library-time LLM cost.

A Learning Method for Symbolic Systems Using Large Language Models

cs.SE · 2026-05-09 · unverdicted · novelty 7.0

LLM2Ltac mines symbolic tactics from 11,725 Coq theorems using LLMs and integrates them into CoqHammer, improving proof rates by 23.87% on 6,199 theorems from four large verification projects.

Combined Program Analysis Techniques: A Systematic Mapping Study

cs.SE · 2026-05-19 · unverdicted · novelty 6.0

A systematic mapping study of 248 papers introduces a taxonomy of synergistic effects, inter-analysis workflows, and mapping functions to catalog patterns in combined program analysis techniques.

Guiding LLM-based Loop Invariant Synthesis via Feedback on Local Reasoning Errors

cs.PL · 2026-05-18 · unverdicted · novelty 6.0

LORIS detects local reasoning errors in LLM-generated proofs for loop invariants by translating natural-language steps to first-order logic implications and using invalid implications to refine the invariants, achieving 93.1% success on 460 C programs.

Babbling Suppression: Making LLMs Greener One Token at a Time

cs.SE · 2026-04-08 · unverdicted · novelty 6.0

Babbling Suppression stops LLM code generation upon test passage to reduce token output and energy consumption by up to 65% across Python and Java benchmarks.

Boosting Automatic Java-to-Cangjie Translation with Multi-Stage LLM Training and Error Repair

cs.SE · 2026-05-08 · unverdicted · novelty 5.0

Multi-stage LLM training plus compiler-guided error repair boosts functional equivalence in Java-to-Cangjie translation by 6.06% over prior methods despite scarce parallel data.

Walking the Tightrope of LLMs for Software Development: A Practitioners' Perspective

cs.SE · 2025-11-09 · conditional · novelty 5.0

Qualitative interview study with 22 practitioners identifies multi-level benefits, challenges, and mitigation strategies for using LLMs in software development.

Evaluating LLM-Generated ACSL Annotations for Formal Verification

cs.SE · 2026-02-14 · unverdicted · novelty 4.0

Rule-based annotation generation for ACSL outperforms LLM-based methods in achieving successful formal verification of C programs.

Search-Based Software Engineering and AI Foundation Models: Current Landscape and Future Roadmap

cs.SE · 2025-05-26 · unverdicted · novelty 4.0

A research roadmap analyzing the current state of search-based software engineering with foundation models, outlining challenges and directions across three integration aspects.

citing papers explorer

Showing 10 of 10 citing papers.

From Runnable to Shippable: Multi-Agent Test-Driven Development for Generating Full-Stack Web Applications from Requirements cs.SE · 2026-05-17 · unverdicted · none · ref 30
TDDev automates the full TDD loop for web app generation from requirements, delivering 34-48 percentage point quality gains and zero manual intervention in user studies.
SkillOps: Managing LLM Agent Skill Libraries as Self-Maintaining Software Ecosystems cs.SE · 2026-05-13 · unverdicted · none · ref 36
SkillOps maintains LLM skill libraries via Skill Contracts and ecosystem graphs, raising ALFWorld task success to 79.5% as a standalone agent and improving retrieval baselines by up to 2.9 points with near-zero library-time LLM cost.
A Learning Method for Symbolic Systems Using Large Language Models cs.SE · 2026-05-09 · unverdicted · none · ref 28
LLM2Ltac mines symbolic tactics from 11,725 Coq theorems using LLMs and integrates them into CoqHammer, improving proof rates by 23.87% on 6,199 theorems from four large verification projects.
Combined Program Analysis Techniques: A Systematic Mapping Study cs.SE · 2026-05-19 · unverdicted · none · ref 101
A systematic mapping study of 248 papers introduces a taxonomy of synergistic effects, inter-analysis workflows, and mapping functions to catalog patterns in combined program analysis techniques.
Guiding LLM-based Loop Invariant Synthesis via Feedback on Local Reasoning Errors cs.PL · 2026-05-18 · unverdicted · none · ref 23
LORIS detects local reasoning errors in LLM-generated proofs for loop invariants by translating natural-language steps to first-order logic implications and using invalid implications to refine the invariants, achieving 93.1% success on 460 C programs.
Babbling Suppression: Making LLMs Greener One Token at a Time cs.SE · 2026-04-08 · unverdicted · none · ref 25
Babbling Suppression stops LLM code generation upon test passage to reduce token output and energy consumption by up to 65% across Python and Java benchmarks.
Boosting Automatic Java-to-Cangjie Translation with Multi-Stage LLM Training and Error Repair cs.SE · 2026-05-08 · unverdicted · none · ref 23
Multi-stage LLM training plus compiler-guided error repair boosts functional equivalence in Java-to-Cangjie translation by 6.06% over prior methods despite scarce parallel data.
Walking the Tightrope of LLMs for Software Development: A Practitioners' Perspective cs.SE · 2025-11-09 · conditional · none · ref 41
Qualitative interview study with 22 practitioners identifies multi-level benefits, challenges, and mitigation strategies for using LLMs in software development.
Evaluating LLM-Generated ACSL Annotations for Formal Verification cs.SE · 2026-02-14 · unverdicted · none · ref 13
Rule-based annotation generation for ACSL outperforms LLM-based methods in achieving successful formal verification of C programs.
Search-Based Software Engineering and AI Foundation Models: Current Landscape and Future Roadmap cs.SE · 2025-05-26 · unverdicted · none · ref 196
A research roadmap analyzing the current state of search-based software engineering with foundation models, outlining challenges and directions across three integration aspects.

Test-driven development and llm-based code generation,

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer