pith. sign in

Information gain-based policy optimization: A simple and effective approach for multi-turn llm agents

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

years

2026 10

roles

background 2

polarities

background 2

representative citing papers

Uncertainty-Aware Clarification in LLM Agents with Information Gain

cs.AI · 2026-06-02 · unverdicted · novelty 5.0

The paper introduces an Information Gain Reward to train clarification behavior in LLM agents, reporting a 3.7% success rate gain over no-clarification baselines in τ-Bench evaluations across five models with minimal added steps.

citing papers explorer

Showing 10 of 10 citing papers.