The authors introduce the task of asking clarifying questions for open-domain information-seeking conversations, collect the Qulac dataset from TREC topics, and propose a retrieval framework that outperforms baselines with an oracle showing 170% P@1 gain.
Bruce Croft
2 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 2representative citing papers
MirrorBench defines a reproducible benchmark combining lexical metrics (MATTR, Yule's K, HD-D) and LLM-judge metrics with calibration controls to measure human-likeness of user-proxy agents across four datasets.
citing papers explorer
-
Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
The authors introduce the task of asking clarifying questions for open-domain information-seeking conversations, collect the Qulac dataset from TREC topics, and propose a retrieval framework that outperforms baselines with an oracle showing 170% P@1 gain.
-
MirrorBench: A Benchmark to Evaluate Conversational User-Proxy Agents for Human-Likeness
MirrorBench defines a reproducible benchmark combining lexical metrics (MATTR, Yule's K, HD-D) and LLM-judge metrics with calibration controls to measure human-likeness of user-proxy agents across four datasets.