Baseline reference

Program synthesis with large language models

Jacob Austin, Augustus Odena, Maxwell Nye, Maarten Bosma, Henryk Michalewski, David Dohan, Ellen Jiang, Carrie Cai, Michael Terry, Quoc Le, Charles Sutton · 2021

Baseline reference. 60% of citing Pith papers use this work as a benchmark or comparison.

7 Pith papers citing it

Baseline 60% of classified citations

browse 7 citing papers

citation-role summary

dataset 3 background 2

citation-polarity summary

use dataset 3 background 2

representative citing papers

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

cs.CL · 2023-10-10 · unverdicted · novelty 8.0

SWE-bench reveals that even top language models like Claude 2 resolve only 1.96% of 2,294 real-world GitHub issues, highlighting a gap in practical coding capabilities.

Dystruct: Dynamically Structured Diffusion Language Model Decoding via Bayesian Inference

cs.LG · 2026-05-10 · unverdicted · novelty 7.0

Dystruct formulates flexible-length generation in diffusion language models as a dynamic structural inference problem solved via Bayesian integration of local uncertainty and structural signals.

MAS-Algorithm: A Workflow for Solving Algorithmic Programming Problems with a Multi-Agent System

cs.AI · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

MAS-Algorithm is a multi-agent workflow that improves AI acceptance rates on algorithmic problems by 6.48% on average, outperforming parameter-efficient fine-tuning.

AI Alignment via Incentives and Correction

cs.LG · 2026-05-02 · unverdicted · novelty 6.0 · 2 refs

AI alignment is reframed as a fixed-point incentive problem in a solver-auditor pipeline, solved via bilevel optimization and bandit search over reward profiles to maintain monitoring and reduce hallucinations in LLM coding tasks.

MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework

cs.AI · 2023-08-01 · unverdicted · novelty 6.0

MetaGPT embeds human SOPs into LLM prompts to create role-specialized agent teams that produce more coherent solutions on collaborative software engineering tasks than prior chat-based multi-agent systems.

CodeWiki: Evaluating AI's Ability to Generate Holistic Documentation for Large-Scale Codebases

cs.SE · 2025-10-28 · unverdicted · novelty 5.0

CodeWiki presents a unified framework for repository-level documentation across seven languages using hierarchical decomposition, recursive multi-agent processing, and multi-modal synthesis, outperforming DeepWiki by 4.73% on CodeWikiBench.

Retrieval-Augmented Code Generation: A Survey with Focus on Repository-Level Approaches

cs.SE · 2025-10-06 · unverdicted · novelty 4.0

The paper organizes repository-level retrieval-augmented code generation into a unified framework covering retrieval substrate, control regime, and evaluation setting while summarizing strategies, datasets, and challenges.

citing papers explorer

Showing 7 of 7 citing papers.

SWE-bench: Can Language Models Resolve Real-World GitHub Issues? cs.CL · 2023-10-10 · unverdicted · none · ref 80
SWE-bench reveals that even top language models like Claude 2 resolve only 1.96% of 2,294 real-world GitHub issues, highlighting a gap in practical coding capabilities.
Dystruct: Dynamically Structured Diffusion Language Model Decoding via Bayesian Inference cs.LG · 2026-05-10 · unverdicted · none · ref 45
Dystruct formulates flexible-length generation in diffusion language models as a dynamic structural inference problem solved via Bayesian integration of local uncertainty and structural signals.
MAS-Algorithm: A Workflow for Solving Algorithmic Programming Problems with a Multi-Agent System cs.AI · 2026-05-07 · unverdicted · none · ref 3 · 2 links
MAS-Algorithm is a multi-agent workflow that improves AI acceptance rates on algorithmic problems by 6.48% on average, outperforming parameter-efficient fine-tuning.
AI Alignment via Incentives and Correction cs.LG · 2026-05-02 · unverdicted · none · ref 5 · 2 links
AI alignment is reframed as a fixed-point incentive problem in a solver-auditor pipeline, solved via bilevel optimization and bandit search over reward profiles to maintain monitoring and reduce hallucinations in LLM coding tasks.
MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework cs.AI · 2023-08-01 · unverdicted · none · ref 2
MetaGPT embeds human SOPs into LLM prompts to create role-specialized agent teams that produce more coherent solutions on collaborative software engineering tasks than prior chat-based multi-agent systems.
CodeWiki: Evaluating AI's Ability to Generate Holistic Documentation for Large-Scale Codebases cs.SE · 2025-10-28 · unverdicted · none · ref 2
CodeWiki presents a unified framework for repository-level documentation across seven languages using hierarchical decomposition, recursive multi-agent processing, and multi-modal synthesis, outperforming DeepWiki by 4.73% on CodeWikiBench.
Retrieval-Augmented Code Generation: A Survey with Focus on Repository-Level Approaches cs.SE · 2025-10-06 · unverdicted · none · ref 146
The paper organizes repository-level retrieval-augmented code generation into a unified framework covering retrieval substrate, control regime, and evaluation setting while summarizing strategies, datasets, and challenges.

Program synthesis with large language models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer