2501.04899v1 , archivePrefix =

Zubkova, Hanna, Park, Ji-Hoon, Lee, Seong-Whan , year = · 2018 · arXiv 2501.04899

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

HiPRAG: Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation

cs.CL · 2025-10-09 · unverdicted · novelty 7.0

HiPRAG adds hierarchical process rewards to RL training for agentic RAG, reducing over-search to 2.3% and achieving 65.4-67.2% accuracy on seven QA benchmarks across 3B and 7B models.

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search

cs.AI · 2026-05-28 · unverdicted · novelty 5.0

SAAS applies RL with boundary modeling via rollout contrasts, boundary-aware rewards, and staged optimization to reduce over-search in agentic LLMs while preserving accuracy.

Hybrid Adversarial Defence for Natural Language Understanding Tasks

cs.CL · 2026-06-03 · unverdicted · novelty 4.0

Hybrid entropy-uncertainty-geometric defence improves clean accuracy by up to 43% and adversarial robustness by up to 65% on NLU and security benchmarks.

citing papers explorer

Showing 2 of 2 citing papers after filters.

HiPRAG: Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation cs.CL · 2025-10-09 · unverdicted · none · ref 30
HiPRAG adds hierarchical process rewards to RL training for agentic RAG, reducing over-search to 2.3% and achieving 65.4-67.2% accuracy on seven QA benchmarks across 3B and 7B models.
Hybrid Adversarial Defence for Natural Language Understanding Tasks cs.CL · 2026-06-03 · unverdicted · none · ref 54
Hybrid entropy-uncertainty-geometric defence improves clean accuracy by up to 43% and adversarial robustness by up to 65% on NLU and security benchmarks.

2501.04899v1 , archivePrefix =

fields

years

verdicts

representative citing papers

citing papers explorer