//arxiv.org/abs/2304.07590, 2304.07590

Dong Y, Jiang X, Jin Z, Li G ( · 2024 · arXiv 2304.07590

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Social Bias in LLM-Generated Code: Benchmark and Mitigation

cs.SE · 2026-05-01 · unverdicted · novelty 7.0

LLMs show up to 60.58% social bias in generated code; a new Fairness Monitor Agent cuts bias by 65.1% and raises functional correctness from 75.80% to 83.97%.

How Many Tries Does It Take? Iterative Self-Repair in LLM Code Generation Across Model Scales and Benchmarks

cs.SE · 2026-04-12 · unverdicted · novelty 4.0

Iterative self-repair improves LLM code pass rates by 4.9-17.1 pp on HumanEval and 16-30 pp on MBPP across seven models, with gains concentrated early and syntax errors easier to fix than logical ones.

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

cs.AI · 2023-08-16

citing papers explorer

Showing 3 of 3 citing papers.

Social Bias in LLM-Generated Code: Benchmark and Mitigation cs.SE · 2026-05-01 · unverdicted · none · ref 89
LLMs show up to 60.58% social bias in generated code; a new Fairness Monitor Agent cuts bias by 65.1% and raises functional correctness from 75.80% to 83.97%.
How Many Tries Does It Take? Iterative Self-Repair in LLM Code Generation Across Model Scales and Benchmarks cs.SE · 2026-04-12 · unverdicted · none · ref 23
Iterative self-repair improves LLM code pass rates by 4.9-17.1 pp on HumanEval and 16-30 pp on MBPP across seven models, with gains concentrated early and syntax errors easier to fix than logical ones.
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation cs.AI · 2023-08-16 · unreviewed · ref 1

//arxiv.org/abs/2304.07590, 2304.07590

fields

years

verdicts

representative citing papers

citing papers explorer