Only Ask What You Don't Know: Grounded Delta Planning for Efficient Multi-step RAG

Claire Lin; Hung-yi Lee; Jian-Ren Lin; Jyh-Shing Roger Jang; Wei-Chieh Chou; Xuanjun Chen

arxiv: 2606.22681 · v1 · pith:6NXN6FQRnew · submitted 2026-06-21 · 💻 cs.CL · cs.AI

Only Ask What You Don't Know: Grounded Delta Planning for Efficient Multi-step RAG

Wei-Chieh Chou , Xuanjun Chen , Jian-Ren Lin , Claire Lin , Hung-yi Lee , Jyh-Shing Roger Jang This is my paper

Pith reviewed 2026-06-26 10:08 UTC · model grok-4.3

classification 💻 cs.CL cs.AI

keywords multi-hop question answeringretrieval-augmented generationplanningdelta planningefficient RAGgap-conditioned promptsHotpotQAMuSiQue

0 comments

The pith

GDP-RAG improves multi-hop RAG by grounding plans in a preliminary retrieval pass and then retrieving only the information gaps, reaching 60.63% accuracy at lower cost than prior methods.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents Grounded Delta Planning RAG (GDP-RAG) as a way to reduce error propagation and wasted steps in iterative retrieval-augmented generation for multi-hop questions. It begins with a preliminary retrieval to ground the plan, then uses a prompt that explicitly asks the model to name only the missing information needed. A skeletal trajectory carries evidence from that first pass forward through each subquery. On HotpotQA, 2WikiMultiHopQA, and MuSiQue this produces the highest accuracy among compared systems together with a cost-of-pass 22% below PAR-RAG and 68% below KnowTrace.

Core claim

GDP-RAG is a plan-based framework that targets only the information delta based on three design choices: preliminary retrieval to ground planning before execution, a gap-conditioned planning prompt that asks only for missing information, and a skeletal trajectory that pairs each subquery with a Thought capturing evidence from preliminary retrieval and carrying it through to the final answer. The method focuses computation on unresolved gaps and yields concise, reliable reasoning trajectories.

What carries the argument

Grounded Delta Planning, which combines a preliminary retrieval pass with a gap-conditioned prompt that identifies only missing information before any further retrieval occurs.

If this is right

Multi-hop QA can be performed with fewer retrieval rounds while preserving or increasing answer accuracy.
Error accumulation across iterative retrieval is reduced when plans are conditioned on already-retrieved evidence.
Over-generation of unnecessary reasoning steps is avoided by restricting the planner to explicit information gaps.
No compared method simultaneously exceeded GDP-RAG accuracy and undercut its cost-of-pass.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same preliminary-grounding pattern could be tested on iterative tasks outside question answering, such as multi-step code generation or tool use.
If the first retrieval pass is weak in a new domain, adding a second cheap initial pass might be needed before planning begins.
The skeletal trajectory structure may also limit hallucination by forcing every subquery to reference already-grounded evidence.

Load-bearing premise

The preliminary retrieval pass reliably surfaces enough evidence for the planner to identify true missing information without missing critical context that would appear only in later rounds.

What would settle it

A controlled test set in which the initial retrieval consistently omits at least one fact required to answer the question, with measurement of whether GDP-RAG accuracy then drops below the strongest baseline.

Figures

Figures reproduced from arXiv: 2606.22681 by Claire Lin, Hung-yi Lee, Jian-Ren Lin, Jyh-Shing Roger Jang, Wei-Chieh Chou, Xuanjun Chen.

**Figure 2.** Figure 2: GDP-RAG framework. The workflow proceeds in three phases: (1) [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: Experimental results of the proposed GDPRAG framework. (a) shows the trade-off [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: Comparison of planning statistics (a) and reasoning complexity by query hops (b). [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗

**Figure 5.** Figure 5: Prompt of Direct Planning [PITH_FULL_IMAGE:figures/full_fig_p015_5.png] view at source ↗

**Figure 6.** Figure 6: Prompt of Grounded Delta Planning. Direct Planning Prompt As shown in [PITH_FULL_IMAGE:figures/full_fig_p015_6.png] view at source ↗

read the original abstract

Multi-hop question answering remains challenging for Retrieval-Augmented Generation (RAG) because existing approaches either propagate errors across iterative retrieval rounds or over-generate reasoning steps, increasing cost without improving accuracy. We propose Grounded Delta Planning RAG (GDP-RAG), a plan-based framework that targets only the information delta based on three simple design choices: (1) preliminary retrieval to ground planning before execution, (2) a gap-conditioned planning prompt that asks only for missing information, and (3) a skeletal trajectory that pairs each subquery with a Thought capturing evidence from preliminary retrieval and carrying it through to the final answer. GDP-RAG focuses computation on unresolved gaps, yielding concise, reliable reasoning trajectories. Extensive experiments on HotpotQA, 2WikiMultiHopQA, and MuSiQue show that GDP-RAG achieves the highest accuracy (60.63%) among all compared systems while maintaining a cost-of-pass of 0.51, 22% lower than PAR-RAG (0.65) and 68% lower than KnowTrace (1.57), with no method achieving both higher accuracy and lower cost.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

GDP-RAG is a clean incremental design for pruning extra retrieval rounds in multi-hop QA, but the headline accuracy-cost numbers rest on an assumption about the preliminary pass that the abstract leaves unexamined.

read the letter

The new piece is the specific combination of a first retrieval to ground the plan, a prompt that asks only for missing information, and a skeletal trajectory that carries the initial evidence forward. That pattern is not in the PAR-RAG or KnowTrace baselines they cite, and it directly targets the error-propagation and over-generation problems they name.

The work does what it sets out to do on paper: it keeps trajectories short by conditioning the planner on gaps and reusing the preliminary evidence. The reported 60.63% accuracy at 0.51 cost-of-pass beats the compared systems on the joint metric, which is the practical point.

The soft spot is the one the stress-test flags. The cost advantage comes from avoiding later rounds, yet that only works if the single preliminary pass already surfaces enough context to let the gap planner see every real missing piece. On HotpotQA-style items the supporting facts are often not co-retrieved first time, so the planner could under-specify and the carried Thoughts would stay incomplete. The abstract gives no ablation on that assumption, no failure cases, and no controls on dataset splits or variance, so the numbers are hard to trust at face value.

This is for people who implement multi-hop RAG pipelines and want lower per-query cost. It is worth a referee because the design is explicit and the efficiency claim is testable, even if the current evidence is thin.

Referee Report

2 major / 1 minor

Summary. The paper proposes Grounded Delta Planning RAG (GDP-RAG), a plan-based framework for multi-hop QA that relies on three design choices: (1) a preliminary retrieval pass to ground planning, (2) a gap-conditioned prompt that requests only missing information, and (3) a skeletal trajectory pairing subqueries with Thoughts from the preliminary pass. Experiments on HotpotQA, 2WikiMultiHopQA, and MuSiQue claim GDP-RAG attains the highest accuracy (60.63%) at the lowest cost-of-pass (0.51), outperforming baselines such as PAR-RAG (0.65) and KnowTrace (1.57) with no method dominating on both metrics.

Significance. If the experimental claims hold under rigorous controls, the work would be significant for demonstrating that targeted delta planning grounded in an initial retrieval can simultaneously improve accuracy and reduce retrieval cost in multi-step RAG, a practical advance over iterative or over-generating baselines.

major comments (2)

[Experiments] Experiments section: the headline accuracy (60.63%) and cost-of-pass (0.51) figures are reported without error bars, dataset split details, number of runs, ablation studies, or statistical significance tests. Because the central claim that GDP-RAG dominates the accuracy-cost frontier rests entirely on these numbers, the absence of these controls makes the result difficult to evaluate.
[Method] Method, design choice (1): the framework assumes a single preliminary retrieval pass will surface sufficient evidence for the gap-conditioned planner to identify all true deltas on multi-hop items. No analysis or failure-case breakdown is provided for HotpotQA-style questions where supporting facts are not co-retrieved in the first round; this assumption is load-bearing for both the accuracy and the reported cost advantage.

minor comments (1)

The abstract and method description would benefit from explicit notation for the skeletal trajectory and cost-of-pass metric to improve reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the thoughtful comments, which highlight important areas for strengthening the experimental rigor and methodological analysis. We address each point below and will incorporate revisions to address the concerns.

read point-by-point responses

Referee: [Experiments] Experiments section: the headline accuracy (60.63%) and cost-of-pass (0.51) figures are reported without error bars, dataset split details, number of runs, ablation studies, or statistical significance tests. Because the central claim that GDP-RAG dominates the accuracy-cost frontier rests entirely on these numbers, the absence of these controls makes the result difficult to evaluate.

Authors: We agree that the current presentation of results would benefit from additional statistical controls. In the revised manuscript we will report results aggregated over multiple runs (with means and standard deviations), explicitly state the dataset splits employed, include ablation studies isolating each of the three design choices, and add statistical significance tests comparing GDP-RAG against the strongest baselines. revision: yes
Referee: [Method] Method, design choice (1): the framework assumes a single preliminary retrieval pass will surface sufficient evidence for the gap-conditioned planner to identify all true deltas on multi-hop items. No analysis or failure-case breakdown is provided for HotpotQA-style questions where supporting facts are not co-retrieved in the first round; this assumption is load-bearing for both the accuracy and the reported cost advantage.

Authors: The preliminary retrieval step is intended to provide grounding for subsequent planning, and the overall empirical results support its utility. We acknowledge that a dedicated failure-case analysis for instances where supporting facts are not co-retrieved would strengthen the paper. In the revision we will add such an analysis, including quantitative breakdown of retrieval coverage on multi-hop items and qualitative examination of how the gap-conditioned prompt behaves when the initial pass is incomplete. revision: yes

Circularity Check

0 steps flagged

No circularity; performance metrics are direct experimental outcomes with no derivation chain.

full rationale

The paper describes a RAG framework via three explicit design choices and reports empirical accuracy (60.63%) and cost-of-pass (0.51) on HotpotQA, 2WikiMultiHopQA, and MuSiQue. No equations, fitted parameters, self-citations, or uniqueness theorems are invoked to derive results. The reported figures are presented as measured experimental outcomes, not quantities that reduce to the inputs by construction. The preliminary-retrieval assumption is a methodological choice whose sufficiency is evaluated empirically rather than presupposed in a tautological manner.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, axioms, or invented entities; all technical detail is deferred to the unavailable full text.

pith-pipeline@v0.9.1-grok · 5748 in / 1093 out tokens · 21040 ms · 2026-06-26T10:08:20.212349+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

105 extracted references · 25 canonical work pages · 7 internal anchors

[1]

A Preliminary Study of

Lin, Claire and Feng, Bo-Han and Chen, Xuanjun and Yang, Te-Lun and Lee, Hung-yi and Jang, Jyh-Shing Roger , journal=. A Preliminary Study of
[2]

arXiv preprint arXiv:2212.10509 , year =

Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions , author =. arXiv preprint arXiv:2212.10509 , year =

Pith/arXiv arXiv
[3]

arXiv preprint arXiv:2210.03629 , year =

ReAct: Synergizing Reasoning and Acting in Language Models , author =. arXiv preprint arXiv:2210.03629 , year =

Pith/arXiv arXiv
[4]

arXiv preprint arXiv:2501.14342 , year =

Chain-of-Retrieval Augmented Generation , author =. arXiv preprint arXiv:2501.14342 , year =

arXiv
[5]

arXiv preprint arXiv:2310.11511 , year =

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection , author =. arXiv preprint arXiv:2310.11511 , year =

Pith/arXiv arXiv
[7]

Findings of the Association for Computational Linguistics: EMNLP 2024 , year =

Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs , author =. Findings of the Association for Computational Linguistics: EMNLP 2024 , year =

2024
[8]

arXiv preprint arXiv:2410.20753 , year =

Plan*RAG: Efficient Test-Time Planning for Retrieval Augmented Generation , author =. arXiv preprint arXiv:2410.20753 , year =

arXiv
[9]

arXiv preprint arXiv:2406.12430 , year =

PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers , author =. arXiv preprint arXiv:2406.12430 , year =. doi:10.48550/arXiv.2406.12430 , url =

work page doi:10.48550/arxiv.2406.12430
[11]

Thirty-seventh Conference on Neural Information Processing Systems , year=

Reflexion: language agents with verbal reinforcement learning , author=. Thirty-seventh Conference on Neural Information Processing Systems , year=
[12]

Advances in Neural Information Processing Systems (NeurIPS) , year =

HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models , author =. Advances in Neural Information Processing Systems (NeurIPS) , year =
[13]

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models , author =. arXiv preprint arXiv:2502.14802 , year =. doi:10.48550/arXiv.2502.14802 , url =

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2502.14802
[14]

Advances in Neural Information Processing Systems , volume =

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks , author =. Advances in Neural Information Processing Systems , volume =. 2020 , publisher =

2020
[15]

, booktitle =

Yang, Zhilin and Qi, Peng and Zhang, Saizheng and Bengio, Yoshua and Cohen, William and Salakhutdinov, Ruslan and Manning, Christopher D. , booktitle =. 2018 , pages =

2018
[16]

Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020) , month = dec, year = 2020, address =

Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps , author =. Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020) , month = dec, year = 2020, address =

2020
[17]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL) , url =

MuSiQue: Multi-hop Questions via Single-hop Question Composition , author =. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL) , url =. 2022 , pages =

2022
[18]

2024 , eprint=

M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation , author=. 2024 , eprint=

2024
[19]

Chen, Jianlv and Xiao, Shitao and Zhang, Peitian and Luo, Kun and Lian, Defu and Liu, Zheng , year =
[20]

2024 , howpublished =

OpenAI , title =. 2024 , howpublished =

2024
[21]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , pages =

ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction , author =. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , pages =. 2022 , address =

2022
[22]

NeurIPS , year =

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models , author =. NeurIPS , year =
[23]

NeurIPS , year =

Large Language Models are Zero-Shot Reasoners , author =. NeurIPS , year =
[24]

NAACL , year =

KILT: a Benchmark for Knowledge Intensive Language Tasks , author =. NAACL , year =
[25]

NAACL , year =

The Web as a Knowledge-Base for Answering Complex Questions , author =. NAACL , year =
[26]

TACL , year =

Constructing Datasets for Multi-hop Reading Comprehension Across Documents , author =. TACL , year =
[27]

arXiv preprint , volume =

Retrieval-Augmented Generation for Large Language Models: A Survey , author =. arXiv preprint , volume =. 2023 , url =

2023
[28]

arXiv preprint arXiv:2501.09136 , year =

Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG , author =. arXiv preprint arXiv:2501.09136 , year =

Pith/arXiv arXiv
[29]

Citation-Enhanced Generation for

Li, Weitao and Li, Junkai and Ma, Weizhi and Liu, Yang , year =. Citation-Enhanced Generation for
[30]

Advances in Neural Information Processing Systems , volume =

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models , author =. Advances in Neural Information Processing Systems , volume =. 2022 , publisher =. doi:10.5555/3600270.3602070 , url =

work page doi:10.5555/3600270.3602070 2022
[31]

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy , author =. Findings of the Association for Computational Linguistics: EMNLP 2023 , year =. doi:10.18653/v1/2023.findings-emnlp.620 , url =

work page doi:10.18653/v1/2023.findings-emnlp.620 2023
[32]

Transactions of the Association for Computational Linguistics , volume=

Lost in the Middle: How Language Models Use Long Contexts , author=. Transactions of the Association for Computational Linguistics , volume=
[33]

Thirty-seventh Conference on Neural Information Processing Systems , year=

Faith and Fate: Limits of Transformers on Compositionality , author=. Thirty-seventh Conference on Neural Information Processing Systems , year=
[34]

Findings of the Association for Computational Linguistics: EMNLP 2023 , pages =

Towards Mitigating Hallucination in Large Language Models via Self Reflection , author =. Findings of the Association for Computational Linguistics: EMNLP 2023 , pages =. 2023 , address =

2023
[35]

NeurIPS , url=

ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models , author=. NeurIPS , url=
[36]

arXiv preprint arXiv:2309.11495 , url=

Chain-of-Verification Reduces Hallucination in Large Language Models , author=. arXiv preprint arXiv:2309.11495 , url=

Pith/arXiv arXiv
[37]

2024 , archivePrefix=

Knowledge-Aware Query Expansion with Large Language Models for Textual and Relational Retrieval , author =. 2024 , archivePrefix=. 2410.13765 , primaryClass =

arXiv 2024
[38]

Proceedings of the Twelfth International Conference on Learning Representations , year =

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection , author =. Proceedings of the Twelfth International Conference on Learning Representations , year =
[39]

2024 , archivePrefix=

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters , author =. 2024 , archivePrefix=. 2408.03314 , primaryClass =

Pith/arXiv arXiv 2024
[40]

Query Rewriting in Retrieval-Augmented Large Language Models

Query Rewriting for Retrieval-Augmented Large Language Models , author =. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing , pages =. 2023 , address =. doi:10.18653/v1/2023.emnlp-main.322 , url =

work page doi:10.18653/v1/2023.emnlp-main.322 2023
[41]

Precise Zero-Shot Dense Retrieval without Relevance Labels , booktitle =

Precise Zero-Shot Dense Retrieval without Relevance Labels , author =. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , year =. doi:10.18653/v1/2023.acl-long.99 , url =

work page doi:10.18653/v1/2023.acl-long.99 2023
[42]

Query2doc: Query Expansion with Large Language Models

Query2doc: Query Expansion with Large Language Models , author =. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing , pages =. 2023 , address =. doi:10.18653/v1/2023.emnlp-main.585 , url =

work page doi:10.18653/v1/2023.emnlp-main.585 2023
[43]

2024 , archivePrefix=

From Local to Global: A Graph RAG Approach to Query-Focused Summarization , author =. 2024 , archivePrefix=. 2404.16130 , primaryClass =

Pith/arXiv arXiv 2024
[44]

2024 , archivePrefix=

Corrective Retrieval Augmented Generation , author =. 2024 , archivePrefix=. 2401.15884 , primaryClass =

Pith/arXiv arXiv 2024
[45]

Measuring and Narrowing the Compositionality Gap in Language Models

Measuring and Narrowing the Compositionality Gap in Language Models , author =. Findings of the Association for Computational Linguistics: EMNLP 2023 , pages =. 2023 , address =. doi:10.18653/v1/2023.findings-emnlp.378 , url =

work page doi:10.18653/v1/2023.findings-emnlp.378 2023
[47]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) , year=

KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing , author=. Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) , year=. doi:10.48550/arXiv.2505.20245 , url=

work page doi:10.48550/arxiv.2505.20245
[48]

arXiv preprint arXiv:2408.10490 , year=

Plan-based Retrieval-Augmented Generation: Scaling Multi-hop Reasoning with Large Language Models , author=. arXiv preprint arXiv:2408.10490 , year=. 2408.10490 , archivePrefix=

arXiv
[49]

arXiv preprint arXiv:2511.07445 , year =

Lin, Claire and Feng, Bo-Han and Chen, Xuanjun and Yang, Te-Lun and Lee, Hung-yi and Jang, Jyh-Shing Roger , title =. arXiv preprint arXiv:2511.07445 , year =

arXiv
[50]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =

Trivedi, Harsh and Balasubramanian, Niranjan and Khot, Tushar and Sabharwal, Ashish , title =. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =. 2023 , doi =

2023
[51]

and Cao, Yuan , title =

Yao, Shunyu and Zhao, Jeffrey and Yu, Dian and Du, Nan and Shafran, Izhak and Narasimhan, Karthik R. and Cao, Yuan , title =. The Eleventh International Conference on Learning Representations (ICLR 2023) , year =

2023
[52]

arXiv preprint arXiv:2501.14342 , year =

Wang, Liang and Chen, Haonan and Yang, Nan and Huang, Xiaolong and Dou, Zhicheng and Wei, Furu , title =. arXiv preprint arXiv:2501.14342 , year =. doi:10.48550/arXiv.2501.14342 , url =

work page doi:10.48550/arxiv.2501.14342
[53]

arXiv preprint arXiv:2504.16787 , year =

Zhang, Ningning and Zhang, Chi and Tan, Zhizhong and Yang, Xingxing and Deng, Weiping and Wang, Wenyong , title =. arXiv preprint arXiv:2504.16787 , year =. doi:10.48550/arXiv.2504.16787 , url =

work page doi:10.48550/arxiv.2504.16787
[54]

and Zhang, Wen and Chen, Huajun

Wang, Junjie and Chen, Mingyang and Hu, Binbin and Yang, Dan and Liu, Ziqi and Shen, Yue and Wei, Peng and Zhang, Zhiqiang and Gu, Jinjie and Zhou, Jun and Pan, Jeff Z. and Zhang, Wen and Chen, Huajun , title =. Findings of the Association for Computational Linguistics: EMNLP 2024 , year =. doi:10.18653/v1/2024.findings-emnlp.459 , url =

work page doi:10.18653/v1/2024.findings-emnlp.459 2024
[55]

arXiv preprint arXiv:2410.20753 , year =

Verma, Prakhar and Midigeshi, Sukruta Prakash and Sinha, Gaurav and Solin, Arno and Natarajan, Nagarajan and Sharma, Amit , title =. arXiv preprint arXiv:2410.20753 , year =. doi:10.48550/arXiv.2410.20753 , url =

work page doi:10.48550/arxiv.2410.20753
[56]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: NAACL 2024 , pages =

Lee, Myeonghwa and An, Seonho and Kim, Min-Soo , title =. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: NAACL 2024 , pages =. 2024 , doi =

2024
[57]

and Lewis, Mike , title =

Press, Ofir and Zhang, Muru and Min, Sewon and Schmidt, Ludwig and Smith, Noah A. and Lewis, Mike , title =. Findings of the Association for Computational Linguistics: EMNLP 2023 , pages =. 2023 , doi =

2023
[58]

Advances in Neural Information Processing Systems (NeurIPS 2023) , year =

Shinn, Noah and Cassano, Federico and Gopinath, Ashwin and Narasimhan, Karthik and Yao, Shunyu , title =. Advances in Neural Information Processing Systems (NeurIPS 2023) , year =

2023
[59]

Advances in Neural Information Processing Systems (NeurIPS 2024) , year =

Gutierrez, Bernal Jimenez and Shu, Yiheng and Gu, Yu and Yasunaga, Michihiro and Su, Yu , title =. Advances in Neural Information Processing Systems (NeurIPS 2024) , year =

2024
[60]

Gutierrez, Bernal Jim. From. Proceedings of the 42nd International Conference on Machine Learning (ICML 2025) , year =

2025
[61]

Retrieval-Augmented Generation for Knowledge-Intensive

Lewis, Patrick and Perez, Ethan and Piktus, Aleksandra and Petroni, Fabio and Karpukhin, Vladimir and Goyal, Naman and K. Retrieval-Augmented Generation for Knowledge-Intensive. Advances in Neural Information Processing Systems (NeurIPS 2020) , year =

2020
[62]

and Salakhutdinov, Ruslan and Manning, Christopher D

Yang, Zhilin and Qi, Peng and Zhang, Saizheng and Bengio, Yoshua and Cohen, William W. and Salakhutdinov, Ruslan and Manning, Christopher D. , title =. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018) , pages =. 2018 , doi =

2018
[63]

Proceedings of COLING 2020 , pages =

Xanh Ho and Anh-Khoa Duong Nguyen and Saku Sugawara and Akiko Aizawa , title =. Proceedings of COLING 2020 , pages =. 2020 , doi =

2020
[64]

Transactions of the Association for Computational Linguistics , volume =

Harsh Trivedi and Niranjan Balasubramanian and Tushar Khot and Ashish Sabharwal , title =. Transactions of the Association for Computational Linguistics , volume =. 2022 , doi =

2022
[65]

Findings of ACL 2024 , pages =

Jianlyu Chen and Shitao Xiao and Peitian Zhang and Kun Luo and Defu Lian and Zheng Liu , title =. Findings of ACL 2024 , pages =. 2024 , doi =

2024
[66]

2024 , url =

Chen, Jianlv and Xiao, Shitao and Zhang, Peitian and Luo, Kun and Lian, Defu and Liu, Zheng , title =. 2024 , url =

2024
[67]

2024 , url =

OpenAI , title =. 2024 , url =

2024
[68]

Proceedings of NAACL 2022 , pages =

Keshav Santhanam and Omar Khattab and Jon Saad-Falcon and Christopher Potts and Matei Zaharia , title =. Proceedings of NAACL 2022 , pages =. 2022 , doi =

2022
[69]

Chi and Quoc V

Jason Wei and Xuezhi Wang and Dale Schuurmans and Maarten Bosma and Brian Ichter and Fei Xia and Ed H. Chi and Quoc V. Le and Denny Zhou , title =. NeurIPS 2022 , year =

2022
[70]

NeurIPS 2022 , year =

Takeshi Kojima and Shixiang Shane Gu and Machel Reid and Yutaka Matsuo and Yusuke Iwasawa , title =. NeurIPS 2022 , year =

2022
[71]

NAACL 2021 , pages =

Fabio Petroni and Aleksandra Piktus and Angela Fan and Patrick Lewis and Majid Yazdani and Nicola De Cao and James Thorne and Yacine Jernite and Vladimir Karpukhin and Jean Maillard and Vassilis Plachouras and Tim Rockt. NAACL 2021 , pages =. 2021 , doi =

2021
[72]

NAACL 2018 , pages =

Alon Talmor and Jonathan Berant , title =. NAACL 2018 , pages =. 2018 , doi =

2018
[73]

TACL , volume =

Johannes Welbl and Pontus Stenetorp and Sebastian Riedel , title =. TACL , volume =. 2018 , doi =

2018
[74]

Retrieval-Augmented Generation for Large Language Models: A Survey

Yunfan Gao and Yun Xiong and Xinyu Gao and Kangxiang Jia and Jinliu Pan and Yuxi Bi and Yi Dai and Jiawei Sun and Qianyu Guo and Meng Wang and Haofen Wang , title =. arXiv preprint arXiv:2312.10997 , year =. doi:10.48550/arXiv.2312.10997 , url =

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2312.10997
[75]

Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG

Aditi Singh and Abul Ehtesham and Saket Kumar and Tala Talaei Khoei , title =. arXiv preprint arXiv:2501.09136 , year =. doi:10.48550/arXiv.2501.09136 , url =

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2501.09136
[76]

ACL 2024 , pages =

Weitao Li and Junkai Li and Weizhi Ma and Yang Liu , title =. ACL 2024 , pages =. 2024 , doi =

2024
[77]

Findings of EMNLP 2023 , pages =

Zhihong Shao and Yeyun Gong and Yelong Shen and Minlie Huang and Nan Duan and Weizhu Chen , title =. Findings of EMNLP 2023 , pages =. 2023 , doi =

2023
[78]

Liu and Kevin Lin and John Hewitt and Ashwin Paranjape and Michele Bevilacqua and Fabio Petroni and Percy Liang , title =

Nelson F. Liu and Kevin Lin and John Hewitt and Ashwin Paranjape and Michele Bevilacqua and Fabio Petroni and Percy Liang , title =. Transactions of the Association for Computational Linguistics , volume =. 2024 , doi =

2024
[79]

Hwang and Soumya Sanyal and Xiang Ren and Allyson Ettinger and Za

Nouha Dziri and Ximing Lu and Melanie Sclar and Xiang Lorraine Li and Liwei Jiang and Bill Yuchen Lin and Sean Welleck and Peter West and Chandra Bhagavatula and Ronan Le Bras and Jena D. Hwang and Soumya Sanyal and Xiang Ren and Allyson Ettinger and Za. Faith and Fate: Limits of Transformers on Compositionality , booktitle =. 2023 , url =

2023
[80]

Findings of EMNLP 2023 , pages =

Ziwei Ji and Tiezheng Yu and Yan Xu and Nayeon Lee and Etsuko Ishii and Pascale Fung , title =. Findings of EMNLP 2023 , pages =. 2023 , doi =

2023
[81]

ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models

Binfeng Xu and Zhiyuan Peng and Bowen Lei and Subhabrata Mukherjee and Yuchen Liu and Dongkuan Xu , title =. arXiv preprint arXiv:2305.18323 , year =. doi:10.48550/arXiv.2305.18323 , url =

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2305.18323
[82]

Findings of ACL 2024 , pages =

Shehzaad Dhuliawala and Mojtaba Komeili and Jing Xu and Roberta Raileanu and Xian Li and Asli Celikyilmaz and Jason Weston , title =. Findings of ACL 2024 , pages =. 2024 , doi =

2024
[83]

Rossi and Haoliang Wang and Julian J

Yu Xia and Junda Wu and Sungchul Kim and Tong Yu and Ryan A. Rossi and Haoliang Wang and Julian J. McAuley , title =. NAACL 2025 , pages =. 2025 , doi =

2025

Showing first 80 references.

[1] [1]

A Preliminary Study of

Lin, Claire and Feng, Bo-Han and Chen, Xuanjun and Yang, Te-Lun and Lee, Hung-yi and Jang, Jyh-Shing Roger , journal=. A Preliminary Study of

[2] [2]

arXiv preprint arXiv:2212.10509 , year =

Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions , author =. arXiv preprint arXiv:2212.10509 , year =

Pith/arXiv arXiv

[3] [3]

arXiv preprint arXiv:2210.03629 , year =

ReAct: Synergizing Reasoning and Acting in Language Models , author =. arXiv preprint arXiv:2210.03629 , year =

Pith/arXiv arXiv

[4] [4]

arXiv preprint arXiv:2501.14342 , year =

Chain-of-Retrieval Augmented Generation , author =. arXiv preprint arXiv:2501.14342 , year =

arXiv

[5] [5]

arXiv preprint arXiv:2310.11511 , year =

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection , author =. arXiv preprint arXiv:2310.11511 , year =

Pith/arXiv arXiv

[6] [7]

Findings of the Association for Computational Linguistics: EMNLP 2024 , year =

Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs , author =. Findings of the Association for Computational Linguistics: EMNLP 2024 , year =

2024

[7] [8]

arXiv preprint arXiv:2410.20753 , year =

Plan*RAG: Efficient Test-Time Planning for Retrieval Augmented Generation , author =. arXiv preprint arXiv:2410.20753 , year =

arXiv

[8] [9]

arXiv preprint arXiv:2406.12430 , year =

PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers , author =. arXiv preprint arXiv:2406.12430 , year =. doi:10.48550/arXiv.2406.12430 , url =

work page doi:10.48550/arxiv.2406.12430

[9] [11]

Thirty-seventh Conference on Neural Information Processing Systems , year=

Reflexion: language agents with verbal reinforcement learning , author=. Thirty-seventh Conference on Neural Information Processing Systems , year=

[10] [12]

Advances in Neural Information Processing Systems (NeurIPS) , year =

HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models , author =. Advances in Neural Information Processing Systems (NeurIPS) , year =

[11] [13]

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models , author =. arXiv preprint arXiv:2502.14802 , year =. doi:10.48550/arXiv.2502.14802 , url =

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2502.14802

[12] [14]

Advances in Neural Information Processing Systems , volume =

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks , author =. Advances in Neural Information Processing Systems , volume =. 2020 , publisher =

2020

[13] [15]

, booktitle =

Yang, Zhilin and Qi, Peng and Zhang, Saizheng and Bengio, Yoshua and Cohen, William and Salakhutdinov, Ruslan and Manning, Christopher D. , booktitle =. 2018 , pages =

2018

[14] [16]

Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020) , month = dec, year = 2020, address =

Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps , author =. Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020) , month = dec, year = 2020, address =

2020

[15] [17]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL) , url =

MuSiQue: Multi-hop Questions via Single-hop Question Composition , author =. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL) , url =. 2022 , pages =

2022

[16] [18]

2024 , eprint=

M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation , author=. 2024 , eprint=

2024

[17] [19]

Chen, Jianlv and Xiao, Shitao and Zhang, Peitian and Luo, Kun and Lian, Defu and Liu, Zheng , year =

[18] [20]

2024 , howpublished =

OpenAI , title =. 2024 , howpublished =

2024

[19] [21]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , pages =

ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction , author =. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , pages =. 2022 , address =

2022

[20] [22]

NeurIPS , year =

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models , author =. NeurIPS , year =

[21] [23]

NeurIPS , year =

Large Language Models are Zero-Shot Reasoners , author =. NeurIPS , year =

[22] [24]

NAACL , year =

KILT: a Benchmark for Knowledge Intensive Language Tasks , author =. NAACL , year =

[23] [25]

NAACL , year =

The Web as a Knowledge-Base for Answering Complex Questions , author =. NAACL , year =

[24] [26]

TACL , year =

Constructing Datasets for Multi-hop Reading Comprehension Across Documents , author =. TACL , year =

[25] [27]

arXiv preprint , volume =

Retrieval-Augmented Generation for Large Language Models: A Survey , author =. arXiv preprint , volume =. 2023 , url =

2023

[26] [28]

arXiv preprint arXiv:2501.09136 , year =

Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG , author =. arXiv preprint arXiv:2501.09136 , year =

Pith/arXiv arXiv

[27] [29]

Citation-Enhanced Generation for

Li, Weitao and Li, Junkai and Ma, Weizhi and Liu, Yang , year =. Citation-Enhanced Generation for

[28] [30]

Advances in Neural Information Processing Systems , volume =

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models , author =. Advances in Neural Information Processing Systems , volume =. 2022 , publisher =. doi:10.5555/3600270.3602070 , url =

work page doi:10.5555/3600270.3602070 2022

[29] [31]

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy , author =. Findings of the Association for Computational Linguistics: EMNLP 2023 , year =. doi:10.18653/v1/2023.findings-emnlp.620 , url =

work page doi:10.18653/v1/2023.findings-emnlp.620 2023

[30] [32]

Transactions of the Association for Computational Linguistics , volume=

Lost in the Middle: How Language Models Use Long Contexts , author=. Transactions of the Association for Computational Linguistics , volume=

[31] [33]

Thirty-seventh Conference on Neural Information Processing Systems , year=

Faith and Fate: Limits of Transformers on Compositionality , author=. Thirty-seventh Conference on Neural Information Processing Systems , year=

[32] [34]

Findings of the Association for Computational Linguistics: EMNLP 2023 , pages =

Towards Mitigating Hallucination in Large Language Models via Self Reflection , author =. Findings of the Association for Computational Linguistics: EMNLP 2023 , pages =. 2023 , address =

2023

[33] [35]

NeurIPS , url=

ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models , author=. NeurIPS , url=

[34] [36]

arXiv preprint arXiv:2309.11495 , url=

Chain-of-Verification Reduces Hallucination in Large Language Models , author=. arXiv preprint arXiv:2309.11495 , url=

Pith/arXiv arXiv

[35] [37]

2024 , archivePrefix=

Knowledge-Aware Query Expansion with Large Language Models for Textual and Relational Retrieval , author =. 2024 , archivePrefix=. 2410.13765 , primaryClass =

arXiv 2024

[36] [38]

Proceedings of the Twelfth International Conference on Learning Representations , year =

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection , author =. Proceedings of the Twelfth International Conference on Learning Representations , year =

[37] [39]

2024 , archivePrefix=

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters , author =. 2024 , archivePrefix=. 2408.03314 , primaryClass =

Pith/arXiv arXiv 2024

[38] [40]

Query Rewriting in Retrieval-Augmented Large Language Models

Query Rewriting for Retrieval-Augmented Large Language Models , author =. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing , pages =. 2023 , address =. doi:10.18653/v1/2023.emnlp-main.322 , url =

work page doi:10.18653/v1/2023.emnlp-main.322 2023

[39] [41]

Precise Zero-Shot Dense Retrieval without Relevance Labels , booktitle =

Precise Zero-Shot Dense Retrieval without Relevance Labels , author =. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , year =. doi:10.18653/v1/2023.acl-long.99 , url =

work page doi:10.18653/v1/2023.acl-long.99 2023

[40] [42]

Query2doc: Query Expansion with Large Language Models

Query2doc: Query Expansion with Large Language Models , author =. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing , pages =. 2023 , address =. doi:10.18653/v1/2023.emnlp-main.585 , url =

work page doi:10.18653/v1/2023.emnlp-main.585 2023

[41] [43]

2024 , archivePrefix=

From Local to Global: A Graph RAG Approach to Query-Focused Summarization , author =. 2024 , archivePrefix=. 2404.16130 , primaryClass =

Pith/arXiv arXiv 2024

[42] [44]

2024 , archivePrefix=

Corrective Retrieval Augmented Generation , author =. 2024 , archivePrefix=. 2401.15884 , primaryClass =

Pith/arXiv arXiv 2024

[43] [45]

Measuring and Narrowing the Compositionality Gap in Language Models

Measuring and Narrowing the Compositionality Gap in Language Models , author =. Findings of the Association for Computational Linguistics: EMNLP 2023 , pages =. 2023 , address =. doi:10.18653/v1/2023.findings-emnlp.378 , url =

work page doi:10.18653/v1/2023.findings-emnlp.378 2023

[44] [47]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) , year=

KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing , author=. Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) , year=. doi:10.48550/arXiv.2505.20245 , url=

work page doi:10.48550/arxiv.2505.20245

[45] [48]

arXiv preprint arXiv:2408.10490 , year=

Plan-based Retrieval-Augmented Generation: Scaling Multi-hop Reasoning with Large Language Models , author=. arXiv preprint arXiv:2408.10490 , year=. 2408.10490 , archivePrefix=

arXiv

[46] [49]

arXiv preprint arXiv:2511.07445 , year =

Lin, Claire and Feng, Bo-Han and Chen, Xuanjun and Yang, Te-Lun and Lee, Hung-yi and Jang, Jyh-Shing Roger , title =. arXiv preprint arXiv:2511.07445 , year =

arXiv

[47] [50]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =

Trivedi, Harsh and Balasubramanian, Niranjan and Khot, Tushar and Sabharwal, Ashish , title =. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =. 2023 , doi =

2023

[48] [51]

and Cao, Yuan , title =

Yao, Shunyu and Zhao, Jeffrey and Yu, Dian and Du, Nan and Shafran, Izhak and Narasimhan, Karthik R. and Cao, Yuan , title =. The Eleventh International Conference on Learning Representations (ICLR 2023) , year =

2023

[49] [52]

arXiv preprint arXiv:2501.14342 , year =

Wang, Liang and Chen, Haonan and Yang, Nan and Huang, Xiaolong and Dou, Zhicheng and Wei, Furu , title =. arXiv preprint arXiv:2501.14342 , year =. doi:10.48550/arXiv.2501.14342 , url =

work page doi:10.48550/arxiv.2501.14342

[50] [53]

arXiv preprint arXiv:2504.16787 , year =

Zhang, Ningning and Zhang, Chi and Tan, Zhizhong and Yang, Xingxing and Deng, Weiping and Wang, Wenyong , title =. arXiv preprint arXiv:2504.16787 , year =. doi:10.48550/arXiv.2504.16787 , url =

work page doi:10.48550/arxiv.2504.16787

[51] [54]

and Zhang, Wen and Chen, Huajun

Wang, Junjie and Chen, Mingyang and Hu, Binbin and Yang, Dan and Liu, Ziqi and Shen, Yue and Wei, Peng and Zhang, Zhiqiang and Gu, Jinjie and Zhou, Jun and Pan, Jeff Z. and Zhang, Wen and Chen, Huajun , title =. Findings of the Association for Computational Linguistics: EMNLP 2024 , year =. doi:10.18653/v1/2024.findings-emnlp.459 , url =

work page doi:10.18653/v1/2024.findings-emnlp.459 2024

[52] [55]

arXiv preprint arXiv:2410.20753 , year =

Verma, Prakhar and Midigeshi, Sukruta Prakash and Sinha, Gaurav and Solin, Arno and Natarajan, Nagarajan and Sharma, Amit , title =. arXiv preprint arXiv:2410.20753 , year =. doi:10.48550/arXiv.2410.20753 , url =

work page doi:10.48550/arxiv.2410.20753

[53] [56]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: NAACL 2024 , pages =

Lee, Myeonghwa and An, Seonho and Kim, Min-Soo , title =. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: NAACL 2024 , pages =. 2024 , doi =

2024

[54] [57]

and Lewis, Mike , title =

Press, Ofir and Zhang, Muru and Min, Sewon and Schmidt, Ludwig and Smith, Noah A. and Lewis, Mike , title =. Findings of the Association for Computational Linguistics: EMNLP 2023 , pages =. 2023 , doi =

2023

[55] [58]

Advances in Neural Information Processing Systems (NeurIPS 2023) , year =

Shinn, Noah and Cassano, Federico and Gopinath, Ashwin and Narasimhan, Karthik and Yao, Shunyu , title =. Advances in Neural Information Processing Systems (NeurIPS 2023) , year =

2023

[56] [59]

Advances in Neural Information Processing Systems (NeurIPS 2024) , year =

Gutierrez, Bernal Jimenez and Shu, Yiheng and Gu, Yu and Yasunaga, Michihiro and Su, Yu , title =. Advances in Neural Information Processing Systems (NeurIPS 2024) , year =

2024

[57] [60]

Gutierrez, Bernal Jim. From. Proceedings of the 42nd International Conference on Machine Learning (ICML 2025) , year =

2025

[58] [61]

Retrieval-Augmented Generation for Knowledge-Intensive

Lewis, Patrick and Perez, Ethan and Piktus, Aleksandra and Petroni, Fabio and Karpukhin, Vladimir and Goyal, Naman and K. Retrieval-Augmented Generation for Knowledge-Intensive. Advances in Neural Information Processing Systems (NeurIPS 2020) , year =

2020

[59] [62]

and Salakhutdinov, Ruslan and Manning, Christopher D

Yang, Zhilin and Qi, Peng and Zhang, Saizheng and Bengio, Yoshua and Cohen, William W. and Salakhutdinov, Ruslan and Manning, Christopher D. , title =. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018) , pages =. 2018 , doi =

2018

[60] [63]

Proceedings of COLING 2020 , pages =

Xanh Ho and Anh-Khoa Duong Nguyen and Saku Sugawara and Akiko Aizawa , title =. Proceedings of COLING 2020 , pages =. 2020 , doi =

2020

[61] [64]

Transactions of the Association for Computational Linguistics , volume =

Harsh Trivedi and Niranjan Balasubramanian and Tushar Khot and Ashish Sabharwal , title =. Transactions of the Association for Computational Linguistics , volume =. 2022 , doi =

2022

[62] [65]

Findings of ACL 2024 , pages =

Jianlyu Chen and Shitao Xiao and Peitian Zhang and Kun Luo and Defu Lian and Zheng Liu , title =. Findings of ACL 2024 , pages =. 2024 , doi =

2024

[63] [66]

2024 , url =

Chen, Jianlv and Xiao, Shitao and Zhang, Peitian and Luo, Kun and Lian, Defu and Liu, Zheng , title =. 2024 , url =

2024

[64] [67]

2024 , url =

OpenAI , title =. 2024 , url =

2024

[65] [68]

Proceedings of NAACL 2022 , pages =

Keshav Santhanam and Omar Khattab and Jon Saad-Falcon and Christopher Potts and Matei Zaharia , title =. Proceedings of NAACL 2022 , pages =. 2022 , doi =

2022

[66] [69]

Chi and Quoc V

Jason Wei and Xuezhi Wang and Dale Schuurmans and Maarten Bosma and Brian Ichter and Fei Xia and Ed H. Chi and Quoc V. Le and Denny Zhou , title =. NeurIPS 2022 , year =

2022

[67] [70]

NeurIPS 2022 , year =

Takeshi Kojima and Shixiang Shane Gu and Machel Reid and Yutaka Matsuo and Yusuke Iwasawa , title =. NeurIPS 2022 , year =

2022

[68] [71]

NAACL 2021 , pages =

Fabio Petroni and Aleksandra Piktus and Angela Fan and Patrick Lewis and Majid Yazdani and Nicola De Cao and James Thorne and Yacine Jernite and Vladimir Karpukhin and Jean Maillard and Vassilis Plachouras and Tim Rockt. NAACL 2021 , pages =. 2021 , doi =

2021

[69] [72]

NAACL 2018 , pages =

Alon Talmor and Jonathan Berant , title =. NAACL 2018 , pages =. 2018 , doi =

2018

[70] [73]

TACL , volume =

Johannes Welbl and Pontus Stenetorp and Sebastian Riedel , title =. TACL , volume =. 2018 , doi =

2018

[71] [74]

Retrieval-Augmented Generation for Large Language Models: A Survey

Yunfan Gao and Yun Xiong and Xinyu Gao and Kangxiang Jia and Jinliu Pan and Yuxi Bi and Yi Dai and Jiawei Sun and Qianyu Guo and Meng Wang and Haofen Wang , title =. arXiv preprint arXiv:2312.10997 , year =. doi:10.48550/arXiv.2312.10997 , url =

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2312.10997

[72] [75]

Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG

Aditi Singh and Abul Ehtesham and Saket Kumar and Tala Talaei Khoei , title =. arXiv preprint arXiv:2501.09136 , year =. doi:10.48550/arXiv.2501.09136 , url =

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2501.09136

[73] [76]

ACL 2024 , pages =

Weitao Li and Junkai Li and Weizhi Ma and Yang Liu , title =. ACL 2024 , pages =. 2024 , doi =

2024

[74] [77]

Findings of EMNLP 2023 , pages =

Zhihong Shao and Yeyun Gong and Yelong Shen and Minlie Huang and Nan Duan and Weizhu Chen , title =. Findings of EMNLP 2023 , pages =. 2023 , doi =

2023

[75] [78]

Liu and Kevin Lin and John Hewitt and Ashwin Paranjape and Michele Bevilacqua and Fabio Petroni and Percy Liang , title =

Nelson F. Liu and Kevin Lin and John Hewitt and Ashwin Paranjape and Michele Bevilacqua and Fabio Petroni and Percy Liang , title =. Transactions of the Association for Computational Linguistics , volume =. 2024 , doi =

2024

[76] [79]

Hwang and Soumya Sanyal and Xiang Ren and Allyson Ettinger and Za

Nouha Dziri and Ximing Lu and Melanie Sclar and Xiang Lorraine Li and Liwei Jiang and Bill Yuchen Lin and Sean Welleck and Peter West and Chandra Bhagavatula and Ronan Le Bras and Jena D. Hwang and Soumya Sanyal and Xiang Ren and Allyson Ettinger and Za. Faith and Fate: Limits of Transformers on Compositionality , booktitle =. 2023 , url =

2023

[77] [80]

Findings of EMNLP 2023 , pages =

Ziwei Ji and Tiezheng Yu and Yan Xu and Nayeon Lee and Etsuko Ishii and Pascale Fung , title =. Findings of EMNLP 2023 , pages =. 2023 , doi =

2023

[78] [81]

ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models

Binfeng Xu and Zhiyuan Peng and Bowen Lei and Subhabrata Mukherjee and Yuchen Liu and Dongkuan Xu , title =. arXiv preprint arXiv:2305.18323 , year =. doi:10.48550/arXiv.2305.18323 , url =

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2305.18323

[79] [82]

Findings of ACL 2024 , pages =

Shehzaad Dhuliawala and Mojtaba Komeili and Jing Xu and Roberta Raileanu and Xian Li and Asli Celikyilmaz and Jason Weston , title =. Findings of ACL 2024 , pages =. 2024 , doi =

2024

[80] [83]

Rossi and Haoliang Wang and Julian J

Yu Xia and Junda Wu and Sungchul Kim and Tong Yu and Ryan A. Rossi and Haoliang Wang and Julian J. McAuley , title =. NAACL 2025 , pages =. 2025 , doi =

2025