CA-BED uses Bayesian experimental design and simulated conversation trees with LLM likelihoods to optimize multi-turn question selection, reporting 21.8% higher success rates than direct prompting on entity-deduction benchmarks.
Ivan Stelmakh, Yi Luan, Bhuwan Dhingra, and Ming-Wei Chang
3 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 3representative citing papers
Reinforcement learning with a dual recall-precision reward trains models to enumerate valid interpretations and answers for ambiguous inputs using only multiple-answer supervision.
State-of-the-art LLMs respond inconsistently to queries from protected-group personas, with some responses omitting key information that should be provided.
citing papers explorer
-
CA-BED: Conversation-Aware Bayesian Experimental Design
CA-BED uses Bayesian experimental design and simulated conversation trees with LLM likelihoods to optimize multi-turn question selection, reporting 21.8% higher success rates than direct prompting on entity-deduction benchmarks.
-
Reasoning about Intent for Ambiguous Requests
Reinforcement learning with a dual recall-precision reward trains models to enumerate valid interpretations and answers for ambiguous inputs using only multiple-answer supervision.