{"work":{"id":"1367ebdc-fa64-48ff-bca2-8f443dd58d57","openalex_id":null,"doi":null,"arxiv_id":"2407.13193","raw_key":null,"title":"Retrieval-Augmented Generation for Natural Language Processing: A Survey","authors":null,"authors_text":"Shangyu Wu, Ying Xiong, Yufei Cui, Haolun Wu, Can Chen, Ye Yuan, Lianming Huang, Xue Liu, Tei-Wei Kuo, Nan Guan, et al","year":2024,"venue":"cs.CL","abstract":"Large language models (LLMs) have achieved strong empirical performance in various fields, benefiting from their huge amount of parameters that store knowledge. However, LLMs still suffer from several key issues, such as hallucination problems, knowledge update issues, and lacking domain-specific expertise. The appearance of retrieval-augmented generation (RAG), which leverages an external knowledge base to augment LLMs, mitigates these limitations. This paper presents a systematic review of RAG techniques for natural language processing (NLP), with a focus on retrievers and retrieval fusions. We introduce a novel taxonomy of retrieval fusions, such as query-based, logits-based, latent, and parametric fusion, and provide structured comparisons across accessibility, efficiency, and use cases. The paper further examines RAG applications across diverse NLP tasks, discusses evaluation methodologies and benchmark limitations, and analyzes training paradigms with and without knowledge base updates. Finally, we explore industrial deployment considerations and identify emerging challenges and future directions, including security, efficiency, and graph-based retrieval.","external_url":"https://arxiv.org/abs/2407.13193","cited_by_count":null,"metadata_source":"pith","metadata_fetched_at":"2026-05-23T03:32:28.441666+00:00","pith_arxiv_id":"2407.13193","created_at":"2026-05-10T05:56:11.415352+00:00","updated_at":"2026-06-05T21:23:00.469572+00:00","title_quality_ok":true,"display_title":"Retrieval-Augmented Generation for Natural Language Processing: A Survey","render_title":"Retrieval-Augmented Generation for Natural Language Processing: A Survey"},"hub":{"state":{"work_id":"1367ebdc-fa64-48ff-bca2-8f443dd58d57","tier":"hub","tier_reason":"10+ Pith inbound or 1,000+ external citations","pith_inbound_count":10,"external_cited_by_count":null,"distinct_field_count":6,"first_pith_cited_at":"2025-02-14T03:28:36+00:00","last_pith_cited_at":"2026-05-12T16:17:03+00:00","author_build_status":"not_needed","summary_status":"needed","contexts_status":"needed","graph_status":"needed","ask_index_status":"not_needed","reader_status":"not_needed","recognition_status":"not_needed","updated_at":"2026-06-06T05:20:27.068728+00:00","tier_text":"hub"},"tier":"hub","role_counts":[{"context_role":"background","n":2}],"polarity_counts":[{"context_polarity":"background","n":2}],"runs":{},"summary":{},"graph":{},"authors":[]}}