Revisiting rag retrievers: An information theoretic benchmark.arXiv preprint arXiv:2602.21553, 2026

Wenqing Zheng, Dmitri Kalaev, Noah Fatsi, Daniel Barcklow, Owen Reinert, Igor Melnyk, Senthil Kumar, C Bayan Bruss · 2026 · arXiv 2602.21553

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

Quantifying Prior Dominance in RAG Systems

cs.CL · 2026-04-29 · unverdicted · novelty 7.0

Introduces NCU metric using token log-probabilities and finds small language models match or outperform larger ones in strict factual RAG extraction, while commercial APIs show high prior dominance and negative transfer.

How Many Tools Should an LLM Agent See? A Chance-Corrected Answer

cs.IR · 2026-05-23 · unverdicted · novelty 6.0

Bits-over-Random (BoR) is a chance-corrected metric for tool shortlist evaluation that enables query-adaptive depth selection via RL, matching fixed-list coverage with shorter lists on BFCL and ToolBench.

citing papers explorer

Showing 1 of 1 citing paper after filters.

How Many Tools Should an LLM Agent See? A Chance-Corrected Answer cs.IR · 2026-05-23 · unverdicted · none · ref 40
Bits-over-Random (BoR) is a chance-corrected metric for tool shortlist evaluation that enables query-adaptive depth selection via RL, matching fixed-list coverage with shorter lists on BFCL and ToolBench.

Revisiting rag retrievers: An information theoretic benchmark.arXiv preprint arXiv:2602.21553, 2026

fields

years

verdicts

representative citing papers

citing papers explorer