Reliable, Adaptable, and Attributable Language Models with Retrieval

Akari Asai; Danqi Chen; Hannaneh Hajishirzi; Luke Zettlemoyer; Pang Wei Koh; Wen-tau Yih; Zexuan Zhong

arxiv: 2403.03187 · v1 · pith:NZLTJ745new · submitted 2024-03-05 · 💻 cs.CL · cs.AI· cs.LG

Reliable, Adaptable, and Attributable Language Models with Retrieval

Akari Asai , Zexuan Zhong , Danqi Chen , Pang Wei Koh , Luke Zettlemoyer , Hannaneh Hajishirzi , Wen-tau Yih This is my paper

classification 💻 cs.CL cs.AIcs.LG

keywords retrieval-augmentedadaptableattributabledatadatastoresinferenceinfrastructureinteraction

0 comments

read the original abstract

Parametric language models (LMs), which are trained on vast amounts of web data, exhibit remarkable flexibility and capability. However, they still face practical challenges such as hallucinations, difficulty in adapting to new data distributions, and a lack of verifiability. In this position paper, we advocate for retrieval-augmented LMs to replace parametric LMs as the next generation of LMs. By incorporating large-scale datastores during inference, retrieval-augmented LMs can be more reliable, adaptable, and attributable. Despite their potential, retrieval-augmented LMs have yet to be widely adopted due to several obstacles: specifically, current retrieval-augmented LMs struggle to leverage helpful text beyond knowledge-intensive tasks such as question answering, have limited interaction between retrieval and LM components, and lack the infrastructure for scaling. To address these, we propose a roadmap for developing general-purpose retrieval-augmented LMs. This involves a reconsideration of datastores and retrievers, the exploration of pipelines with improved retriever-LM interaction, and significant investment in infrastructure for efficient training and inference.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

AttnTrace: Contextual Attribution of Prompt Injection and Knowledge Corruption
cs.CL 2025-08 unverdicted novelty 6.0

AttnTrace is an attention-weight-based context traceback method for LLMs that claims higher accuracy and efficiency than prior art like TracLLM while aiding prompt injection detection.
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions
cs.CL 2023-11 unverdicted novelty 5.0

The paper surveys hallucination in LLMs with an innovative taxonomy, factors, detection methods, benchmarks, mitigation strategies, and open research directions.
Mahalanobis-Guided Latent OOD Detection for Hybrid ES-DRL Control in Time-Varying Systems
cs.LG 2026-06 unverdicted novelty 3.0

A hybrid ES-DRL controller uses VAE latent Mahalanobis OOD detection to switch between RL and ES modes for time-varying nonlinear systems.