pith. sign in

arxiv: 2606.00809 · v2 · pith:6YLPJMICnew · submitted 2026-05-30 · 💻 cs.AI

NBQ: Next-Best-Question for Dynamic Profiling

Pith reviewed 2026-06-28 18:32 UTC · model grok-4.3

classification 💻 cs.AI
keywords next-best-questiondynamic profilingconversational systemsuser modelingreciprocal matchinginformation gainvector retrieval
0
0 comments X

The pith

NBQ selects the question with highest expected information gain to build structured user profiles from free-form dialogue.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper addresses how an interviewer should choose the next question at each turn to maximize understanding of a person toward a specific goal. It presents NBQ as a framework that seeds candidate questions, tracks a compact user state, picks questions adaptively within a budget, and converts the dialogue into a vector profile. The method is instantiated for reciprocal matchmaking that models both self-description and counterpart preferences. QuickMatch is added to turn the matching task into fast approximate vector search. Experiments report gains in profiling metrics and large speedups in retrieval.

Core claim

At each turn an interviewer should ask the question with the highest expected information gain given what has already been learned and the conversation goal. NBQ seeds a diverse pool of candidate questions, maintains a compact and continuously updated user state, adaptively selects the next question within a turn budget, and distills the resulting free-form dialogue into a structured vector-based user profile. For reciprocal matchmaking compatibility is modeled by both self-description and counterpart-preference representations, and QuickMatch recasts pairwise scoring as approximate vector search.

What carries the argument

Expected information gain computed from a compact user state to choose the next question from a seeded diverse pool.

If this is right

  • Profiling quality rises by up to 13.6 percent in AC@T and 14.0 percent in AR@T.
  • Retrieval speed increases by up to 22.9 times while retaining recall of 0.989.
  • The same selection logic applies to other conversational knowledge-discovery settings such as hiring screens and podcasts.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Compact state representations may reduce the need to store full conversation histories, lowering storage and privacy costs.
  • The approach could be tested in domains where the profiling goal changes mid-conversation.
  • Seeding the question pool from domain-specific sources might be replaced by learned generation without breaking the rest of the pipeline.

Load-bearing premise

Expected information gain can be reliably estimated from the maintained compact user state and a seeded diverse pool of candidate questions is sufficient to cover the profiling goal across turns.

What would settle it

A controlled user study in which profiles built with NBQ-selected questions show no accuracy gain over random or fixed-order questions when measured by downstream task performance.

Figures

Figures reproduced from arXiv: 2606.00809 by Clarice Wang, Haixun Wang, Xiaokui Xiao, Yimin Shi.

Figure 1
Figure 1. Figure 1: NBQ workflow: question-pool construction, adaptive profiling, user representation, and reciprocal matching. (i) streaming updates, the representations 𝑠 and 𝑚 are refreshed im￾mediately upon each new (𝑞, 𝑎) pair; (ii) low latency, the ranked set TopK𝑡 (𝑖) must be refreshed within a fixed time bound, independent of the total user population; and (iii) scalability, both space and computation per refresh shou… view at source ↗
Figure 2
Figure 2. Figure 2: Recall and latency under varying 𝜃. the results. For the Non-transformed (Non-T) setting, we directly encode each user’s raw dialogue regarding self-information and mate-preference into embeddings, while the Transformed method applies the structured transformation introduced in Section 5.1 to summarize high-level insights and filter out superficial noises. As shown in [PITH_FULL_IMAGE:figures/full_fig_p00… view at source ↗
Figure 3
Figure 3. Figure 3: Question generation prompts. [System Prompts] You are an agent skilled at following instructions. You are working for a reciprocal matching system, where we aim to match individuals on two sides with each other. Each person on either side can be represented as x = (x.s, x.m), where x.s captures their self-information, and x.m captures their preferences for a potential partner (mate). More specifically, you… view at source ↗
Figure 4
Figure 4. Figure 4: Transformation-based representation prompts. [PITH_FULL_IMAGE:figures/full_fig_p014_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Generative relevance analysis prompts. [System Prompts] You are an agent skilled at following instructions. We have generated a set of questions for a respondent based on the following application scenario: scenario Each question is assumed to elicit information we are interested in about the person. We expect the respondent’s answer to include specific details about their opinions, values, attitudes, past… view at source ↗
Figure 6
Figure 6. Figure 6: Question analysis prompts [PITH_FULL_IMAGE:figures/full_fig_p015_6.png] view at source ↗
read the original abstract

Many real-world conversational settings for knowledge discovery, including podcasts, hiring screens, and marketplaces, require a purpose-driven understanding of a person. We study the Next-Best-Question (NBQ) problem: at each turn, an interviewer should ask the question with the highest expected information gain given what has already been learned and the conversation goal. We propose NBQ, a plug-and-play framework that seeds a diverse pool of candidate questions, maintains a compact and continuously updated user state, adaptively selects the next question within a turn budget, and distills the resulting free-form dialogue into a structured vector-based user profile. As a demanding application, we instantiate NBQ for reciprocal matchmaking, where compatibility must be mutual and each person is modeled by both self-description and counterpart-preference representations. To support large-scale matching, we further introduce QuickMatch, an efficient retrieval layer that recasts reciprocal matching from quadratic pairwise scoring to approximate vector search. Experiments show that NBQ improves user profiling quality by up to 13.6% and 14.0% in AC@T and AR@T, respectively, while QuickMatch accelerates retrieval by up to 22.9x with recall up to 0.989.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper introduces the Next-Best-Question (NBQ) problem for purpose-driven conversational profiling and proposes a plug-and-play framework that seeds a diverse candidate question pool, maintains a compact continuously updated user state, adaptively selects questions to maximize expected information gain within a turn budget, and distills free-form dialogue into a structured vector profile. It instantiates the approach for reciprocal matchmaking (modeling both self-description and counterpart preferences) and introduces QuickMatch to reduce quadratic pairwise scoring to approximate vector search. Experiments report up to 13.6% and 14.0% gains in AC@T and AR@T profiling metrics plus 22.9x retrieval speedup at recall 0.989.

Significance. If the central claims hold, the work offers a practical, modular approach to dynamic profiling that combines information-theoretic question selection with scalable retrieval; this could be useful for conversational applications in hiring, marketplaces, and knowledge discovery. The explicit separation of state maintenance, selection, and distillation plus the efficiency layer are positive design choices.

major comments (3)
  1. [Abstract / NBQ framework] Abstract and framework description: the headline AC@T/AR@T gains rest on the claim that expected information gain can be reliably estimated from the maintained compact user state. No ablation or comparison to a full-history oracle is reported that would confirm the compact representation preserves sufficient signal across multiple turns; without this, the adaptive selection benefit cannot be isolated from the assumption.
  2. [Experiments] Experiments section (profiling results): the reported 13.6% and 14.0% improvements are stated without error bars, statistical significance tests, or breakdown by number of turns. It is therefore unclear whether the gains are robust or driven by particular dataset characteristics or the seeded pool diversity.
  3. [QuickMatch] QuickMatch description: the reduction from quadratic to approximate vector search is presented as exact in recall@0.989, but no analysis is given of how the reciprocal (mutual) constraint is preserved under the approximation or of failure cases where high recall still yields poor matchmaking quality.
minor comments (2)
  1. [Experiments] Notation for AC@T and AR@T is introduced without an explicit equation or definition in the main text; a short formal definition would improve reproducibility.
  2. [NBQ framework] The seeded candidate pool is described as 'diverse' but no quantitative measure of pool coverage or diversity (e.g., embedding variance or coverage of profiling dimensions) is provided.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their thoughtful and constructive comments. We address each major comment point-by-point below and indicate planned revisions to the manuscript.

read point-by-point responses
  1. Referee: [Abstract / NBQ framework] Abstract and framework description: the headline AC@T/AR@T gains rest on the claim that expected information gain can be reliably estimated from the maintained compact user state. No ablation or comparison to a full-history oracle is reported that would confirm the compact representation preserves sufficient signal across multiple turns; without this, the adaptive selection benefit cannot be isolated from the assumption.

    Authors: We appreciate the referee highlighting this point. The compact state is designed to retain the information necessary for estimating expected information gain, but we agree that an explicit ablation against a full-history oracle would strengthen the isolation of the adaptive selection benefit. We will add this comparison in the revised manuscript. revision: yes

  2. Referee: [Experiments] Experiments section (profiling results): the reported 13.6% and 14.0% improvements are stated without error bars, statistical significance tests, or breakdown by number of turns. It is therefore unclear whether the gains are robust or driven by particular dataset characteristics or the seeded pool diversity.

    Authors: We agree that additional statistical rigor and breakdowns would improve clarity. In the revised version we will report error bars, include statistical significance tests, and provide results broken down by number of turns. revision: yes

  3. Referee: [QuickMatch] QuickMatch description: the reduction from quadratic to approximate vector search is presented as exact in recall@0.989, but no analysis is given of how the reciprocal (mutual) constraint is preserved under the approximation or of failure cases where high recall still yields poor matchmaking quality.

    Authors: We acknowledge the value of analyzing how the reciprocal constraint behaves under approximation. We will add a discussion of this preservation mechanism together with an examination of potential failure cases in the revised QuickMatch section. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical gains reported from proposed framework without reduction to inputs by construction

full rationale

The paper describes a plug-and-play NBQ framework that seeds a candidate pool, maintains a compact user state, selects via expected information gain, and distills to a vector profile, then reports experimental improvements (up to 13.6% AC@T, 14.0% AR@T) and QuickMatch speedups. No equations, derivations, or self-citations are shown that make any claimed result equivalent to its inputs by definition, rename a fitted quantity as a prediction, or rely on load-bearing self-citation chains. The central claims rest on external experimental validation rather than internal self-reference, making the derivation self-contained against the listed circularity patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no free parameters, axioms, or invented entities are described in sufficient detail to populate the ledger.

pith-pipeline@v0.9.1-grok · 5749 in / 1204 out tokens · 30176 ms · 2026-06-28T18:32:38.367827+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

46 extracted references · 8 canonical work pages · 2 internal anchors

  1. [1]

    Zhiliang Chen, Xinyuan Niu, Chuan-Sheng Foo, and Bryan Kian Hsiang Low

  2. [2]

    InInternational Conference on Learning Representations (ICLR ’25), Spotlight

    Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs with Semantic Space. InInternational Conference on Learning Representations (ICLR ’25), Spotlight. https://openreview.net/forum?id=3cgMU3TyyE

  3. [3]

    Matthijs Douze, Alexandr Guzhva, Chengqi Deng, Jeff Johnson, Gergely Szilvasy, Pierre-Emmanuel Mazaré, Maria Lomeli, Lucas Hosseini, and Hervé Jégou. 2024. The faiss library.arXiv preprint arXiv:2401.08281(2024)

  4. [4]

    Naghmeh Farzi and Laura Dietz. 2025. Criteria-Based LLM Relevance Judgments. InProceedings of the 2025 International ACM SIGIR Conference on Innovative Concepts and Theories in Information Retrieval (ICTIR). 254–263

  5. [5]

    Aristides Gionis, Piotr Indyk, Rajeev Motwani, et al. 1999. Similarity search in high dimensions via hashing. InVldb, Vol. 99. 518–529

  6. [6]

    Yunchao Gong, Svetlana Lazebnik, Albert Gordo, and Florent Perronnin. 2012. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval.IEEE transactions on pattern analysis and machine intelligence35, 12 (2012), 2916–2929

  7. [7]

    Baidu Inc. 2023. PUCK: A High-Performance Approximate Nearest Neighbor Search Library. https://github.com/baidu/puck

  8. [8]

    Herve Jegou, Matthijs Douze, and Cordelia Schmid. 2010. Product quantization for nearest neighbor search.IEEE transactions on pattern analysis and machine intelligence33, 1 (2010), 117–128

  9. [9]

    Jeff Johnson, Matthijs Douze, and Hervé Jégou. 2019. Billion-scale similarity search with GPUs.IEEE Transactions on Big Data7, 3 (2019), 535–547

  10. [10]

    Andrew Kane. 2021. pgvector: Open-Source Vector Similarity Search Extension for PostgreSQL. https://github.com/pgvector/pgvector

  11. [11]

    Sara Kemper, Justin Cui, Kai Dicarlantonio, Kathy Lin, Danjie Tang, Anton Ko- rikov, and Scott Sanner. 2024. Retrieval-Augmented Conversational Recommen- dation with Prompt-based Semi-Structured Natural Language State Tracking. InProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’24). AC...

  12. [12]

    Heeseon Kim, Hyeongjun Yang, and Kyong Ho Lee. 2023. Confident Action Decision via Hierarchical Policy Learning for Conversational Recommendation. InProceedings of the ACM Web Conference 2023 (WWW ’23). ACM, Austin, TX, USA, 1386–1395. doi:10.1145/3543507.3583536

  13. [13]

    Kai-Huang Lai, Zhe-Rui Yang, Pei-Yuan Lai, Chang-Dong Wang, Mohsen Guizani, and Min Chen. 2024. Knowledge-Aware Explainable Reciprocal Recommendation. InProceedings of the AAAI Conference on Artificial Intelligence, Vol. 38. 8636–8644. doi:10.1609/aaai.v38i8.28708

  14. [14]

    Belinda Z Li, Alex Tamkin, Noah Goodman, and Jacob Andreas. 2025. Elicit- ing Human Preferences with Language Models. InThe Thirteenth International Conference on Learning Representations

  15. [15]

    Allen Lin, Ziwei Zhu, Jianling Wang, and James Caverlee. 2023. Enhancing User Personalization in Conversational Recommenders. InProceedings of the ACM Web Conference 2023 (WWW ’23). doi:10.48550/arXiv.2302.06656 Extended version: arXiv:2302.06656

  16. [16]

    Jiarui Luo, Miao Qiao, Chaoji Zuo, and Dong Deng. 2025. Tag-Filtered Approxi- mate Nearest Neighbor Search. In2025 IEEE 41st International Conference on Data Engineering (ICDE). IEEE, 3642–3654

  17. [17]

    Yu A Malkov and Dmitry A Yashunin. 2018. Efficient and robust approximate nearest neighbor search using hierarchical navigable small world graphs.IEEE transactions on pattern analysis and machine intelligence42, 4 (2018), 824–836

  18. [18]

    Behnam Neyshabur and Nathan Srebro. 2015. On symmetric and asymmetric lshs for inner product search. InInternational Conference on Machine Learning. PMLR, 1926–1934

  19. [19]

    Hiroyuki Ootomo, Akira Naruse, Corey Nolet, Ray Wang, Tamas Feher, and Yong Wang. 2024. Cagra: Highly parallel graph construction and approximate nearest neighbor search for gpus. In2024 IEEE 40th International Conference on Data Engineering (ICDE). IEEE, 4236–4247

  20. [20]

    James Jie Pan, Jianguo Wang, and Guoliang Li. 2024. Survey of vector database management systems.The VLDB Journal33, 5 (2024), 1591–1615

  21. [21]

    Luiz Pizzato, Tomek Rej, Thomas Chung, Irena Koprinska, and Judy Kay. 2010. RECON: a reciprocal recommender for online dating. InProceedings of the fourth ACM conference on Recommender systems. 207–214

  22. [22]

    Inc. Redis. [n. d.]. Redis as a Vector Database. https://redis.io/docs/latest/develop/ get-started/vector-database/

  23. [23]

    Burr Settles. 2009. Active learning literature survey. (2009)

  24. [24]

    Anshumali Shrivastava and Ping Li. 2014. Asymmetric LSH (ALSH) for sublinear time maximum inner product search (MIPS).Advances in neural information processing systems27 (2014)

  25. [25]

    Harsha Vardhan Simhadri, George Williams, Martin Aumüller, Matthijs Douze, Artem Babenko, Dmitry Baranchuk, Qi Chen, Lucas Hosseini, Ravishankar Krish- naswamny, Gopal Srinivasa, et al. 2022. Results of the NeurIPS’21 challenge on billion-scale approximate nearest neighbor search. InNeurIPS 2021 Competitions and Demonstrations Track. PMLR, 177–189

  26. [26]

    Yitong Song, Kai Wang, Bin Yao, Zhida Chen, Jiong Xie, and Feifei Li. 2024. Effi- cient Reverse 𝑘 Approximate Nearest Neighbor Search Over High-Dimensional Vectors. In2024 IEEE 40th International Conference on Data Engineering (ICDE). IEEE, 4262–4274

  27. [27]

    Hongda Sun, Hongzhan Lin, Haiyu Yan, Yang Song, Xin Gao, and Rui Yan. 2025. MockLLM: A Multi-Agent Behavior Collaboration Framework for Online Job Seeking and Recruiting. InProceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2. 2714–2724

  28. [28]

    Ji Sun, Guoliang Li, James Pan, Jiang Wang, Yongqing Xie, Ruicheng Liu, and Wen Nie. 2025. GaussDB-Vector: A Large-Scale Persistent Real-Time Vector Database for LLM Applications.Proceedings of the VLDB Endowment18, 12 (2025), 4951–4963

  29. [29]

    Yufei Tao, Ke Yi, Cheng Sheng, and Panos Kalnis. 2009. Quality and efficiency in high dimensional nearest neighbor search. InProceedings of the 2009 ACM SIGMOD International Conference on Management of data. 563–576

  30. [30]

    Paul Thomas, Seth Spielman, Nick Craswell, and Bhaskar Mitra. 2024. Large language models can accurately predict searcher preferences. InProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1930–1940

  31. [31]

    Katherine Tian, Eric Mitchell, Allan Zhou, Archit Sharma, Rafael Rafailov, Huaxiu Yao, Chelsea Finn, and Christopher D Manning. 2023. Just ask for calibration: Strategies for eliciting calibrated confidence scores from language models fine- tuned with human feedback.arXiv preprint arXiv:2305.14975(2023)

  32. [32]

    Yao Tian, Xi Zhao, and Xiaofang Zhou. 2022. DB-LSH: Locality-Sensitive Hashing with Query-based Dynamic Bucketing. In2022 IEEE 38th International Conference on Data Engineering (ICDE). IEEE, 2250–2262

  33. [33]

    Clarice Wang, Yimin Shi, and Xiaokui Xiao. 2025. A Framework for Evaluating AI Agents in Open-Ended Conversations via Scripted Simulation. InProceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V

  34. [34]

    Jianguo Wang, Xiaomeng Yi, Rentong Guo, Hai Jin, Peng Xu, Shengjun Li, Xi- angyu Wang, Xiangzhou Guo, Chengming Li, Xiaohai Xu, et al. 2021. Milvus: A purpose-built vector data management system. InProceedings of the 2021 international conference on management of data. 2614–2627

  35. [35]

    Xingyao Wang, Zihan Wang, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng, and Heng Ji. 2024. MINT: EVALUATING LLMS IN MULTI-TURN INTERACTION WITH TOOLS AND LANGUAGE FEEDBACK. In12th International Conference on Learning Representations, ICLR 2024

  36. [36]

    Chuangxian Wei, Bin Wu, Sheng Wang, Renjie Lou, Chaoqun Zhan, Feifei Li, and Yuanzhe Cai. 2020. Analyticdb-v: A hybrid analytical engine towards query fusion for structured and unstructured data.Proceedings of the VLDB Endowment 13, 12 (2020), 3152–3165

  37. [37]

    Peng Xia, Shuangfei Zhai, Benyuan Liu, Yizhou Sun, and Cindy Chen. 2016. Design of Reciprocal Recommendation Systems for Online Dating.Social Network Analysis and Mining6, 1 (2016), 1–17. NBQ: Next-Best-Question for Dynamic Profiling KDD ’26, August 09–13, 2026, Jeju Island, Republic of Korea

  38. [38]

    Jiadong Xie, Jeffrey Xu Yu, and Yingfan Liu. 2025. Fast Approximate Similarity Join in Vector Databases.Proceedings of the ACM on Management of Data3, 3 (2025), 1–26

  39. [39]

    Miao Xiong, Zhiyuan Hu, Xinyang Lu, Yifei Li, Jie Fu, Junxian He, and Bryan Hooi

  40. [40]

    Can llms express their uncertainty? an empirical evaluation of confidence elicitation in llms.arXiv preprint arXiv:2306.13063(2023)

  41. [41]

    Lu Zhang, Chen Li, Yu Lei, Zhu Sun, and Guanfeng Liu. 2024. An empirical analysis on multi-turn conversational recommender systems. InProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval. 841–851

  42. [42]

    Yizhe Zhang, Jiarui Lu, and Navdeep Jaitly. 2024. Probing the Multi-turn Planning Capabilities of LLMs via 20 Question Games. InProceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1495–1516

  43. [43]

    Weijie Zhao, Shulong Tan, and Ping Li. 2020. Song: Approximate nearest neighbor search on gpu. In2020 IEEE 36th International Conference on Data Engineering (ICDE). IEEE, 1033–1044

  44. [44]

    Xi Zhao, Zhonghan Chen, Kai Huang, Ruiyuan Zhang, Bolong Zheng, and Xiao- fang Zhou. 2024. Efficient Approximate Maximum Inner Product Search Over Sparse Vectors. In2024 IEEE 40th International Conference on Data Engineering (ICDE). IEEE, 3961–3974

  45. [45]

    Bowen Zheng, Yupeng Hou, Wayne Xin Zhao, Yang Song, and Hengshu Zhu

  46. [46]

    Quiet Companionability Seeker

    Reciprocal Sequential Recommendation. InProceedings of the 17th ACM Conference on Recommender Systems (RecSys). —. doi:10.1145/3604915.3608798 A Proof of Theorems A.1 Proof of Theorem 1 Theorem 1.Let 𝑓𝑖 𝑗, 𝑓 𝑗𝑖 ∈ ( 0, 1]. If 𝑔(𝑢𝑖, 𝑢𝑗 ) ≥𝜃 , then necessarily [¯s𝑖; ¯m𝑖 ] · [ ¯m𝑗 ; ¯s𝑗 ] ≥4𝜃−2where ¯m= m ∥m∥ and ¯s= s ∥s∥ . Proof.Assume𝑔(𝑢 𝑖, 𝑢𝑗 ) ≥𝜃. By the...