Only Ask What You Don't Know: Grounded Delta Planning for Efficient Multi-step RAG
Pith reviewed 2026-06-26 10:08 UTC · model grok-4.3
The pith
GDP-RAG improves multi-hop RAG by grounding plans in a preliminary retrieval pass and then retrieving only the information gaps, reaching 60.63% accuracy at lower cost than prior methods.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
GDP-RAG is a plan-based framework that targets only the information delta based on three design choices: preliminary retrieval to ground planning before execution, a gap-conditioned planning prompt that asks only for missing information, and a skeletal trajectory that pairs each subquery with a Thought capturing evidence from preliminary retrieval and carrying it through to the final answer. The method focuses computation on unresolved gaps and yields concise, reliable reasoning trajectories.
What carries the argument
Grounded Delta Planning, which combines a preliminary retrieval pass with a gap-conditioned prompt that identifies only missing information before any further retrieval occurs.
If this is right
- Multi-hop QA can be performed with fewer retrieval rounds while preserving or increasing answer accuracy.
- Error accumulation across iterative retrieval is reduced when plans are conditioned on already-retrieved evidence.
- Over-generation of unnecessary reasoning steps is avoided by restricting the planner to explicit information gaps.
- No compared method simultaneously exceeded GDP-RAG accuracy and undercut its cost-of-pass.
Where Pith is reading between the lines
- The same preliminary-grounding pattern could be tested on iterative tasks outside question answering, such as multi-step code generation or tool use.
- If the first retrieval pass is weak in a new domain, adding a second cheap initial pass might be needed before planning begins.
- The skeletal trajectory structure may also limit hallucination by forcing every subquery to reference already-grounded evidence.
Load-bearing premise
The preliminary retrieval pass reliably surfaces enough evidence for the planner to identify true missing information without missing critical context that would appear only in later rounds.
What would settle it
A controlled test set in which the initial retrieval consistently omits at least one fact required to answer the question, with measurement of whether GDP-RAG accuracy then drops below the strongest baseline.
Figures
read the original abstract
Multi-hop question answering remains challenging for Retrieval-Augmented Generation (RAG) because existing approaches either propagate errors across iterative retrieval rounds or over-generate reasoning steps, increasing cost without improving accuracy. We propose Grounded Delta Planning RAG (GDP-RAG), a plan-based framework that targets only the information delta based on three simple design choices: (1) preliminary retrieval to ground planning before execution, (2) a gap-conditioned planning prompt that asks only for missing information, and (3) a skeletal trajectory that pairs each subquery with a Thought capturing evidence from preliminary retrieval and carrying it through to the final answer. GDP-RAG focuses computation on unresolved gaps, yielding concise, reliable reasoning trajectories. Extensive experiments on HotpotQA, 2WikiMultiHopQA, and MuSiQue show that GDP-RAG achieves the highest accuracy (60.63%) among all compared systems while maintaining a cost-of-pass of 0.51, 22% lower than PAR-RAG (0.65) and 68% lower than KnowTrace (1.57), with no method achieving both higher accuracy and lower cost.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes Grounded Delta Planning RAG (GDP-RAG), a plan-based framework for multi-hop QA that relies on three design choices: (1) a preliminary retrieval pass to ground planning, (2) a gap-conditioned prompt that requests only missing information, and (3) a skeletal trajectory pairing subqueries with Thoughts from the preliminary pass. Experiments on HotpotQA, 2WikiMultiHopQA, and MuSiQue claim GDP-RAG attains the highest accuracy (60.63%) at the lowest cost-of-pass (0.51), outperforming baselines such as PAR-RAG (0.65) and KnowTrace (1.57) with no method dominating on both metrics.
Significance. If the experimental claims hold under rigorous controls, the work would be significant for demonstrating that targeted delta planning grounded in an initial retrieval can simultaneously improve accuracy and reduce retrieval cost in multi-step RAG, a practical advance over iterative or over-generating baselines.
major comments (2)
- [Experiments] Experiments section: the headline accuracy (60.63%) and cost-of-pass (0.51) figures are reported without error bars, dataset split details, number of runs, ablation studies, or statistical significance tests. Because the central claim that GDP-RAG dominates the accuracy-cost frontier rests entirely on these numbers, the absence of these controls makes the result difficult to evaluate.
- [Method] Method, design choice (1): the framework assumes a single preliminary retrieval pass will surface sufficient evidence for the gap-conditioned planner to identify all true deltas on multi-hop items. No analysis or failure-case breakdown is provided for HotpotQA-style questions where supporting facts are not co-retrieved in the first round; this assumption is load-bearing for both the accuracy and the reported cost advantage.
minor comments (1)
- The abstract and method description would benefit from explicit notation for the skeletal trajectory and cost-of-pass metric to improve reproducibility.
Simulated Author's Rebuttal
We thank the referee for the thoughtful comments, which highlight important areas for strengthening the experimental rigor and methodological analysis. We address each point below and will incorporate revisions to address the concerns.
read point-by-point responses
-
Referee: [Experiments] Experiments section: the headline accuracy (60.63%) and cost-of-pass (0.51) figures are reported without error bars, dataset split details, number of runs, ablation studies, or statistical significance tests. Because the central claim that GDP-RAG dominates the accuracy-cost frontier rests entirely on these numbers, the absence of these controls makes the result difficult to evaluate.
Authors: We agree that the current presentation of results would benefit from additional statistical controls. In the revised manuscript we will report results aggregated over multiple runs (with means and standard deviations), explicitly state the dataset splits employed, include ablation studies isolating each of the three design choices, and add statistical significance tests comparing GDP-RAG against the strongest baselines. revision: yes
-
Referee: [Method] Method, design choice (1): the framework assumes a single preliminary retrieval pass will surface sufficient evidence for the gap-conditioned planner to identify all true deltas on multi-hop items. No analysis or failure-case breakdown is provided for HotpotQA-style questions where supporting facts are not co-retrieved in the first round; this assumption is load-bearing for both the accuracy and the reported cost advantage.
Authors: The preliminary retrieval step is intended to provide grounding for subsequent planning, and the overall empirical results support its utility. We acknowledge that a dedicated failure-case analysis for instances where supporting facts are not co-retrieved would strengthen the paper. In the revision we will add such an analysis, including quantitative breakdown of retrieval coverage on multi-hop items and qualitative examination of how the gap-conditioned prompt behaves when the initial pass is incomplete. revision: yes
Circularity Check
No circularity; performance metrics are direct experimental outcomes with no derivation chain.
full rationale
The paper describes a RAG framework via three explicit design choices and reports empirical accuracy (60.63%) and cost-of-pass (0.51) on HotpotQA, 2WikiMultiHopQA, and MuSiQue. No equations, fitted parameters, self-citations, or uniqueness theorems are invoked to derive results. The reported figures are presented as measured experimental outcomes, not quantities that reduce to the inputs by construction. The preliminary-retrieval assumption is a methodological choice whose sufficiency is evaluated empirically rather than presupposed in a tautological manner.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
A Preliminary Study of
Lin, Claire and Feng, Bo-Han and Chen, Xuanjun and Yang, Te-Lun and Lee, Hung-yi and Jang, Jyh-Shing Roger , journal=. A Preliminary Study of
-
[2]
arXiv preprint arXiv:2212.10509 , year =
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions , author =. arXiv preprint arXiv:2212.10509 , year =
-
[3]
arXiv preprint arXiv:2210.03629 , year =
ReAct: Synergizing Reasoning and Acting in Language Models , author =. arXiv preprint arXiv:2210.03629 , year =
-
[4]
arXiv preprint arXiv:2501.14342 , year =
Chain-of-Retrieval Augmented Generation , author =. arXiv preprint arXiv:2501.14342 , year =
-
[5]
arXiv preprint arXiv:2310.11511 , year =
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection , author =. arXiv preprint arXiv:2310.11511 , year =
-
[7]
Findings of the Association for Computational Linguistics: EMNLP 2024 , year =
Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs , author =. Findings of the Association for Computational Linguistics: EMNLP 2024 , year =
2024
-
[8]
arXiv preprint arXiv:2410.20753 , year =
Plan*RAG: Efficient Test-Time Planning for Retrieval Augmented Generation , author =. arXiv preprint arXiv:2410.20753 , year =
-
[9]
arXiv preprint arXiv:2406.12430 , year =
PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers , author =. arXiv preprint arXiv:2406.12430 , year =. doi:10.48550/arXiv.2406.12430 , url =
-
[11]
Thirty-seventh Conference on Neural Information Processing Systems , year=
Reflexion: language agents with verbal reinforcement learning , author=. Thirty-seventh Conference on Neural Information Processing Systems , year=
-
[12]
Advances in Neural Information Processing Systems (NeurIPS) , year =
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models , author =. Advances in Neural Information Processing Systems (NeurIPS) , year =
-
[13]
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models , author =. arXiv preprint arXiv:2502.14802 , year =. doi:10.48550/arXiv.2502.14802 , url =
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2502.14802
-
[14]
Advances in Neural Information Processing Systems , volume =
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks , author =. Advances in Neural Information Processing Systems , volume =. 2020 , publisher =
2020
-
[15]
, booktitle =
Yang, Zhilin and Qi, Peng and Zhang, Saizheng and Bengio, Yoshua and Cohen, William and Salakhutdinov, Ruslan and Manning, Christopher D. , booktitle =. 2018 , pages =
2018
-
[16]
Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020) , month = dec, year = 2020, address =
Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps , author =. Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020) , month = dec, year = 2020, address =
2020
-
[17]
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL) , url =
MuSiQue: Multi-hop Questions via Single-hop Question Composition , author =. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL) , url =. 2022 , pages =
2022
-
[18]
2024 , eprint=
M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation , author=. 2024 , eprint=
2024
-
[19]
Chen, Jianlv and Xiao, Shitao and Zhang, Peitian and Luo, Kun and Lian, Defu and Liu, Zheng , year =
-
[20]
2024 , howpublished =
OpenAI , title =. 2024 , howpublished =
2024
-
[21]
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , pages =
ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction , author =. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , pages =. 2022 , address =
2022
-
[22]
NeurIPS , year =
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models , author =. NeurIPS , year =
-
[23]
NeurIPS , year =
Large Language Models are Zero-Shot Reasoners , author =. NeurIPS , year =
-
[24]
NAACL , year =
KILT: a Benchmark for Knowledge Intensive Language Tasks , author =. NAACL , year =
-
[25]
NAACL , year =
The Web as a Knowledge-Base for Answering Complex Questions , author =. NAACL , year =
-
[26]
TACL , year =
Constructing Datasets for Multi-hop Reading Comprehension Across Documents , author =. TACL , year =
-
[27]
arXiv preprint , volume =
Retrieval-Augmented Generation for Large Language Models: A Survey , author =. arXiv preprint , volume =. 2023 , url =
2023
-
[28]
arXiv preprint arXiv:2501.09136 , year =
Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG , author =. arXiv preprint arXiv:2501.09136 , year =
-
[29]
Citation-Enhanced Generation for
Li, Weitao and Li, Junkai and Ma, Weizhi and Liu, Yang , year =. Citation-Enhanced Generation for
-
[30]
Advances in Neural Information Processing Systems , volume =
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models , author =. Advances in Neural Information Processing Systems , volume =. 2022 , publisher =. doi:10.5555/3600270.3602070 , url =
-
[31]
Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy , author =. Findings of the Association for Computational Linguistics: EMNLP 2023 , year =. doi:10.18653/v1/2023.findings-emnlp.620 , url =
-
[32]
Transactions of the Association for Computational Linguistics , volume=
Lost in the Middle: How Language Models Use Long Contexts , author=. Transactions of the Association for Computational Linguistics , volume=
-
[33]
Thirty-seventh Conference on Neural Information Processing Systems , year=
Faith and Fate: Limits of Transformers on Compositionality , author=. Thirty-seventh Conference on Neural Information Processing Systems , year=
-
[34]
Findings of the Association for Computational Linguistics: EMNLP 2023 , pages =
Towards Mitigating Hallucination in Large Language Models via Self Reflection , author =. Findings of the Association for Computational Linguistics: EMNLP 2023 , pages =. 2023 , address =
2023
-
[35]
NeurIPS , url=
ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models , author=. NeurIPS , url=
-
[36]
arXiv preprint arXiv:2309.11495 , url=
Chain-of-Verification Reduces Hallucination in Large Language Models , author=. arXiv preprint arXiv:2309.11495 , url=
-
[37]
Knowledge-Aware Query Expansion with Large Language Models for Textual and Relational Retrieval , author =. 2024 , archivePrefix=. 2410.13765 , primaryClass =
arXiv 2024
-
[38]
Proceedings of the Twelfth International Conference on Learning Representations , year =
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection , author =. Proceedings of the Twelfth International Conference on Learning Representations , year =
-
[39]
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters , author =. 2024 , archivePrefix=. 2408.03314 , primaryClass =
Pith/arXiv arXiv 2024
-
[40]
Query Rewriting in Retrieval-Augmented Large Language Models
Query Rewriting for Retrieval-Augmented Large Language Models , author =. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing , pages =. 2023 , address =. doi:10.18653/v1/2023.emnlp-main.322 , url =
-
[41]
Precise Zero-Shot Dense Retrieval without Relevance Labels , booktitle =
Precise Zero-Shot Dense Retrieval without Relevance Labels , author =. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , year =. doi:10.18653/v1/2023.acl-long.99 , url =
-
[42]
Query2doc: Query Expansion with Large Language Models
Query2doc: Query Expansion with Large Language Models , author =. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing , pages =. 2023 , address =. doi:10.18653/v1/2023.emnlp-main.585 , url =
-
[43]
From Local to Global: A Graph RAG Approach to Query-Focused Summarization , author =. 2024 , archivePrefix=. 2404.16130 , primaryClass =
Pith/arXiv arXiv 2024
-
[44]
Corrective Retrieval Augmented Generation , author =. 2024 , archivePrefix=. 2401.15884 , primaryClass =
Pith/arXiv arXiv 2024
-
[45]
Measuring and Narrowing the Compositionality Gap in Language Models
Measuring and Narrowing the Compositionality Gap in Language Models , author =. Findings of the Association for Computational Linguistics: EMNLP 2023 , pages =. 2023 , address =. doi:10.18653/v1/2023.findings-emnlp.378 , url =
-
[47]
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) , year=
KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing , author=. Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) , year=. doi:10.48550/arXiv.2505.20245 , url=
-
[48]
arXiv preprint arXiv:2408.10490 , year=
Plan-based Retrieval-Augmented Generation: Scaling Multi-hop Reasoning with Large Language Models , author=. arXiv preprint arXiv:2408.10490 , year=. 2408.10490 , archivePrefix=
-
[49]
arXiv preprint arXiv:2511.07445 , year =
Lin, Claire and Feng, Bo-Han and Chen, Xuanjun and Yang, Te-Lun and Lee, Hung-yi and Jang, Jyh-Shing Roger , title =. arXiv preprint arXiv:2511.07445 , year =
-
[50]
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =
Trivedi, Harsh and Balasubramanian, Niranjan and Khot, Tushar and Sabharwal, Ashish , title =. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =. 2023 , doi =
2023
-
[51]
and Cao, Yuan , title =
Yao, Shunyu and Zhao, Jeffrey and Yu, Dian and Du, Nan and Shafran, Izhak and Narasimhan, Karthik R. and Cao, Yuan , title =. The Eleventh International Conference on Learning Representations (ICLR 2023) , year =
2023
-
[52]
arXiv preprint arXiv:2501.14342 , year =
Wang, Liang and Chen, Haonan and Yang, Nan and Huang, Xiaolong and Dou, Zhicheng and Wei, Furu , title =. arXiv preprint arXiv:2501.14342 , year =. doi:10.48550/arXiv.2501.14342 , url =
-
[53]
arXiv preprint arXiv:2504.16787 , year =
Zhang, Ningning and Zhang, Chi and Tan, Zhizhong and Yang, Xingxing and Deng, Weiping and Wang, Wenyong , title =. arXiv preprint arXiv:2504.16787 , year =. doi:10.48550/arXiv.2504.16787 , url =
-
[54]
and Zhang, Wen and Chen, Huajun
Wang, Junjie and Chen, Mingyang and Hu, Binbin and Yang, Dan and Liu, Ziqi and Shen, Yue and Wei, Peng and Zhang, Zhiqiang and Gu, Jinjie and Zhou, Jun and Pan, Jeff Z. and Zhang, Wen and Chen, Huajun , title =. Findings of the Association for Computational Linguistics: EMNLP 2024 , year =. doi:10.18653/v1/2024.findings-emnlp.459 , url =
-
[55]
arXiv preprint arXiv:2410.20753 , year =
Verma, Prakhar and Midigeshi, Sukruta Prakash and Sinha, Gaurav and Solin, Arno and Natarajan, Nagarajan and Sharma, Amit , title =. arXiv preprint arXiv:2410.20753 , year =. doi:10.48550/arXiv.2410.20753 , url =
-
[56]
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: NAACL 2024 , pages =
Lee, Myeonghwa and An, Seonho and Kim, Min-Soo , title =. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: NAACL 2024 , pages =. 2024 , doi =
2024
-
[57]
and Lewis, Mike , title =
Press, Ofir and Zhang, Muru and Min, Sewon and Schmidt, Ludwig and Smith, Noah A. and Lewis, Mike , title =. Findings of the Association for Computational Linguistics: EMNLP 2023 , pages =. 2023 , doi =
2023
-
[58]
Advances in Neural Information Processing Systems (NeurIPS 2023) , year =
Shinn, Noah and Cassano, Federico and Gopinath, Ashwin and Narasimhan, Karthik and Yao, Shunyu , title =. Advances in Neural Information Processing Systems (NeurIPS 2023) , year =
2023
-
[59]
Advances in Neural Information Processing Systems (NeurIPS 2024) , year =
Gutierrez, Bernal Jimenez and Shu, Yiheng and Gu, Yu and Yasunaga, Michihiro and Su, Yu , title =. Advances in Neural Information Processing Systems (NeurIPS 2024) , year =
2024
-
[60]
Gutierrez, Bernal Jim. From. Proceedings of the 42nd International Conference on Machine Learning (ICML 2025) , year =
2025
-
[61]
Retrieval-Augmented Generation for Knowledge-Intensive
Lewis, Patrick and Perez, Ethan and Piktus, Aleksandra and Petroni, Fabio and Karpukhin, Vladimir and Goyal, Naman and K. Retrieval-Augmented Generation for Knowledge-Intensive. Advances in Neural Information Processing Systems (NeurIPS 2020) , year =
2020
-
[62]
and Salakhutdinov, Ruslan and Manning, Christopher D
Yang, Zhilin and Qi, Peng and Zhang, Saizheng and Bengio, Yoshua and Cohen, William W. and Salakhutdinov, Ruslan and Manning, Christopher D. , title =. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018) , pages =. 2018 , doi =
2018
-
[63]
Proceedings of COLING 2020 , pages =
Xanh Ho and Anh-Khoa Duong Nguyen and Saku Sugawara and Akiko Aizawa , title =. Proceedings of COLING 2020 , pages =. 2020 , doi =
2020
-
[64]
Transactions of the Association for Computational Linguistics , volume =
Harsh Trivedi and Niranjan Balasubramanian and Tushar Khot and Ashish Sabharwal , title =. Transactions of the Association for Computational Linguistics , volume =. 2022 , doi =
2022
-
[65]
Findings of ACL 2024 , pages =
Jianlyu Chen and Shitao Xiao and Peitian Zhang and Kun Luo and Defu Lian and Zheng Liu , title =. Findings of ACL 2024 , pages =. 2024 , doi =
2024
-
[66]
2024 , url =
Chen, Jianlv and Xiao, Shitao and Zhang, Peitian and Luo, Kun and Lian, Defu and Liu, Zheng , title =. 2024 , url =
2024
-
[67]
2024 , url =
OpenAI , title =. 2024 , url =
2024
-
[68]
Proceedings of NAACL 2022 , pages =
Keshav Santhanam and Omar Khattab and Jon Saad-Falcon and Christopher Potts and Matei Zaharia , title =. Proceedings of NAACL 2022 , pages =. 2022 , doi =
2022
-
[69]
Chi and Quoc V
Jason Wei and Xuezhi Wang and Dale Schuurmans and Maarten Bosma and Brian Ichter and Fei Xia and Ed H. Chi and Quoc V. Le and Denny Zhou , title =. NeurIPS 2022 , year =
2022
-
[70]
NeurIPS 2022 , year =
Takeshi Kojima and Shixiang Shane Gu and Machel Reid and Yutaka Matsuo and Yusuke Iwasawa , title =. NeurIPS 2022 , year =
2022
-
[71]
NAACL 2021 , pages =
Fabio Petroni and Aleksandra Piktus and Angela Fan and Patrick Lewis and Majid Yazdani and Nicola De Cao and James Thorne and Yacine Jernite and Vladimir Karpukhin and Jean Maillard and Vassilis Plachouras and Tim Rockt. NAACL 2021 , pages =. 2021 , doi =
2021
-
[72]
NAACL 2018 , pages =
Alon Talmor and Jonathan Berant , title =. NAACL 2018 , pages =. 2018 , doi =
2018
-
[73]
TACL , volume =
Johannes Welbl and Pontus Stenetorp and Sebastian Riedel , title =. TACL , volume =. 2018 , doi =
2018
-
[74]
Retrieval-Augmented Generation for Large Language Models: A Survey
Yunfan Gao and Yun Xiong and Xinyu Gao and Kangxiang Jia and Jinliu Pan and Yuxi Bi and Yi Dai and Jiawei Sun and Qianyu Guo and Meng Wang and Haofen Wang , title =. arXiv preprint arXiv:2312.10997 , year =. doi:10.48550/arXiv.2312.10997 , url =
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2312.10997
-
[75]
Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG
Aditi Singh and Abul Ehtesham and Saket Kumar and Tala Talaei Khoei , title =. arXiv preprint arXiv:2501.09136 , year =. doi:10.48550/arXiv.2501.09136 , url =
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2501.09136
-
[76]
ACL 2024 , pages =
Weitao Li and Junkai Li and Weizhi Ma and Yang Liu , title =. ACL 2024 , pages =. 2024 , doi =
2024
-
[77]
Findings of EMNLP 2023 , pages =
Zhihong Shao and Yeyun Gong and Yelong Shen and Minlie Huang and Nan Duan and Weizhu Chen , title =. Findings of EMNLP 2023 , pages =. 2023 , doi =
2023
-
[78]
Liu and Kevin Lin and John Hewitt and Ashwin Paranjape and Michele Bevilacqua and Fabio Petroni and Percy Liang , title =
Nelson F. Liu and Kevin Lin and John Hewitt and Ashwin Paranjape and Michele Bevilacqua and Fabio Petroni and Percy Liang , title =. Transactions of the Association for Computational Linguistics , volume =. 2024 , doi =
2024
-
[79]
Hwang and Soumya Sanyal and Xiang Ren and Allyson Ettinger and Za
Nouha Dziri and Ximing Lu and Melanie Sclar and Xiang Lorraine Li and Liwei Jiang and Bill Yuchen Lin and Sean Welleck and Peter West and Chandra Bhagavatula and Ronan Le Bras and Jena D. Hwang and Soumya Sanyal and Xiang Ren and Allyson Ettinger and Za. Faith and Fate: Limits of Transformers on Compositionality , booktitle =. 2023 , url =
2023
-
[80]
Findings of EMNLP 2023 , pages =
Ziwei Ji and Tiezheng Yu and Yan Xu and Nayeon Lee and Etsuko Ishii and Pascale Fung , title =. Findings of EMNLP 2023 , pages =. 2023 , doi =
2023
-
[81]
ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models
Binfeng Xu and Zhiyuan Peng and Bowen Lei and Subhabrata Mukherjee and Yuchen Liu and Dongkuan Xu , title =. arXiv preprint arXiv:2305.18323 , year =. doi:10.48550/arXiv.2305.18323 , url =
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2305.18323
-
[82]
Findings of ACL 2024 , pages =
Shehzaad Dhuliawala and Mojtaba Komeili and Jing Xu and Roberta Raileanu and Xian Li and Asli Celikyilmaz and Jason Weston , title =. Findings of ACL 2024 , pages =. 2024 , doi =
2024
-
[83]
Rossi and Haoliang Wang and Julian J
Yu Xia and Junda Wu and Sungchul Kim and Tong Yu and Ryan A. Rossi and Haoliang Wang and Julian J. McAuley , title =. NAACL 2025 , pages =. 2025 , doi =
2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.