ISBN 979-8-89176-251-0

Zijun Yao, Weijian Qi, Liangming Pan, Shulin Cao, Linmei Hu, Liu Weichuan, Lei Hou, Juanzi Li · 2025 · DOI 10.18653/v1/2025.acl-long.1312

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

HiPRAG: Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation

cs.CL · 2025-10-09 · unverdicted · novelty 7.0

HiPRAG adds hierarchical process rewards to RL training for agentic RAG, reducing over-search to 2.3% and achieving 65.4-67.2% accuracy on seven QA benchmarks across 3B and 7B models.

When Confidence Takes the Wrong Path: Diagnosing Retrieval-State Lock-In in RAG

cs.CL · 2026-06-22 · unverdicted · novelty 6.0

Retrieval-state lock-in causes zero-dispersion errors in 42% of KG-RAG and 59% of dense-retrieval failures; a three-object check rule reaches 91.9% pooled precision at 7.7% coverage.

MASH: Modeling Abstention via Selective Help-Seeking

cs.CL · 2025-10-01 · unverdicted · novelty 6.0

MASH uses RL with a pay-per-search reward to make LLMs seek external help only when needed, improving multi-hop QA accuracy by 7.6% and enabling competitive abstention without pre-defined knowledge boundaries.

citing papers explorer

Showing 3 of 3 citing papers.

HiPRAG: Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation cs.CL · 2025-10-09 · unverdicted · none · ref 26
HiPRAG adds hierarchical process rewards to RL training for agentic RAG, reducing over-search to 2.3% and achieving 65.4-67.2% accuracy on seven QA benchmarks across 3B and 7B models.
When Confidence Takes the Wrong Path: Diagnosing Retrieval-State Lock-In in RAG cs.CL · 2026-06-22 · unverdicted · none · ref 72
Retrieval-state lock-in causes zero-dispersion errors in 42% of KG-RAG and 59% of dense-retrieval failures; a three-object check rule reaches 91.9% pooled precision at 7.7% coverage.
MASH: Modeling Abstention via Selective Help-Seeking cs.CL · 2025-10-01 · unverdicted · none · ref 21
MASH uses RL with a pay-per-search reward to make LLMs seek external help only when needed, improving multi-hop QA accuracy by 7.6% and enabling competitive abstention without pre-defined knowledge boundaries.

ISBN 979-8-89176-251-0

fields

years

verdicts

representative citing papers

citing papers explorer