Title resolution pending

Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, brian ichter, Fei Xia, Ed Chi, Quoc V Le, Denny Zhou · 2022

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

TraceFix: Repairing Agent Coordination Protocols with TLA+ Counterexamples

cs.AI · 2026-05-08 · conditional · novelty 7.0

TraceFix repairs LLM-generated multi-agent protocols via TLA+ counterexamples to achieve full verification on all tested tasks and higher completion rates than prompt-only baselines.

When More Reformulations Hurt: Avoiding Drift using Ranker Feedback

cs.IR · 2026-05-01 · unverdicted · novelty 7.0

ReformIR adaptively prioritizes reformulations and documents with a surrogate model guided by ranker feedback to boost recall while suppressing drift under fixed reranking budgets.

RubberDuckBench: A Benchmark for AI Coding Assistants

cs.SE · 2026-01-23 · unverdicted · novelty 7.0

RubberDuckBench shows top AI models score around 68% on real GitHub coding questions, rarely answer completely correctly, and hallucinate in 58% of responses on average.

Reproducing Adaptive Reranking for Reasoning-Intensive IR

cs.IR · 2026-04-30 · unverdicted · novelty 2.0

Reproducing GAR on BRIGHT shows it boosts reasoning-intensive retrieval effectiveness with low overhead when the reranker's signal quality is strong.

citing papers explorer

Showing 4 of 4 citing papers.

TraceFix: Repairing Agent Coordination Protocols with TLA+ Counterexamples cs.AI · 2026-05-08 · conditional · none · ref 38
TraceFix repairs LLM-generated multi-agent protocols via TLA+ counterexamples to achieve full verification on all tested tasks and higher completion rates than prompt-only baselines.
When More Reformulations Hurt: Avoiding Drift using Ranker Feedback cs.IR · 2026-05-01 · unverdicted · none · ref 45
ReformIR adaptively prioritizes reformulations and documents with a surrogate model guided by ranker feedback to boost recall while suppressing drift under fixed reranking budgets.
RubberDuckBench: A Benchmark for AI Coding Assistants cs.SE · 2026-01-23 · unverdicted · none · ref 36
RubberDuckBench shows top AI models score around 68% on real GitHub coding questions, rarely answer completely correctly, and hallucinate in 58% of responses on average.
Reproducing Adaptive Reranking for Reasoning-Intensive IR cs.IR · 2026-04-30 · unverdicted · none · ref 44
Reproducing GAR on BRIGHT shows it boosts reasoning-intensive retrieval effectiveness with low overhead when the reranker's signal quality is strong.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer