Reinforced internal-external knowledge synergistic reasoning for efficient adaptive search agent,

· 2025 · arXiv 2505.07596

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

KbSD: Knowledge Boundary aware Self-Distillation for Behavioral Calibration in Agentic Search

cs.CL · 2026-06-29 · unverdicted · novelty 5.0

KbSD uses a same-size hint-augmented teacher and quadrant-adaptive KL objectives to deliver dense supervision for calibrated behavior across knowledge states in agentic search.

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

cs.CL · 2026-06-10 · unverdicted · novelty 5.0

This survey categorizes agentic environments for LLMs by eight attributes and domains, introduces symbolic and neural synthesis paradigms with evaluation, and outlines four agent evolution pathways plus three environment evolution paradigms.

DocArena: Turning Raw Documents into Controllable Training Environments for Document Search Agents

cs.CV · 2026-05-27 · unverdicted · novelty 4.0

DocArena automates creation of multimodal document QA training data via MLLM-based structuring and cross-page reasoning pairs, yielding agents with top retrieval and QA performance in unified tests.

citing papers explorer

Showing 2 of 2 citing papers after filters.

KbSD: Knowledge Boundary aware Self-Distillation for Behavioral Calibration in Agentic Search cs.CL · 2026-06-29 · unverdicted · none · ref 7
KbSD uses a same-size hint-augmented teacher and quadrant-adaptive KL objectives to deliver dense supervision for calibrated behavior across knowledge states in agentic search.
Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application cs.CL · 2026-06-10 · unverdicted · none · ref 24
This survey categorizes agentic environments for LLMs by eight attributes and domains, introduces symbolic and neural synthesis paradigms with evaluation, and outlines four agent evolution pathways plus three environment evolution paradigms.

Reinforced internal-external knowledge synergistic reasoning for efficient adaptive search agent,

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer