Title resolution pending

Differences in wording or language are allowed as long as the core answer is the same

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Search Self-play: Pushing the Frontier of Agent Capability without Supervision

cs.LG · 2025-10-21 · unverdicted · novelty 6.0

SSP trains search agents without supervision by co-evolving a task proposer and solver through self-play, with RAG verification ensuring ground-truth accuracy, yielding uniform gains on benchmarks in both from-scratch and continued RL settings.

Knowledge-Graph Paths as Intermediate Supervision for Self-Evolving Search Agents

cs.AI · 2026-05-07 · unverdicted · novelty 5.0

Knowledge-graph paths reused as intermediate supervision improve self-evolving search agents over standard Search Self-Play on seven QA benchmarks by supplying relational context and graded waypoint rewards.

citing papers explorer

Showing 2 of 2 citing papers.

Search Self-play: Pushing the Frontier of Agent Capability without Supervision cs.LG · 2025-10-21 · unverdicted · none · ref 15
SSP trains search agents without supervision by co-evolving a task proposer and solver through self-play, with RAG verification ensuring ground-truth accuracy, yielding uniform gains on benchmarks in both from-scratch and continued RL settings.
Knowledge-Graph Paths as Intermediate Supervision for Self-Evolving Search Agents cs.AI · 2026-05-07 · unverdicted · none · ref 16
Knowledge-graph paths reused as intermediate supervision improve self-evolving search agents over standard Search Self-Play on seven QA benchmarks by supplying relational context and graded waypoint rewards.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer