IntentTune: Using user demand and personalization to resolve "unknown" query intents for e-commerce search

Chester Palen-Michel; Ishita Khan; Jayanth Yetukuri; Mehran Elyasi; Rachith Aiyappa; Samarth Agrawal; Shuang Zhou

arxiv: 2607.01530 · v1 · pith:S4VRIHGBnew · submitted 2026-07-01 · 💻 cs.IR · cs.AI

IntentTune: Using user demand and personalization to resolve "unknown" query intents for e-commerce search

Rachith Aiyappa , Ishita Khan , Chester Palen-Michel , Jayanth Yetukuri , Samarth Agrawal , Mehran Elyasi , Shuang Zhou This is my paper

Pith reviewed 2026-07-03 18:08 UTC · model grok-4.3

classification 💻 cs.IR cs.AI

keywords query intent detectione-commerce searchpersonalizationuser behavior signalsambiguous queriesintent inferencesearch history

0 comments

The pith

User-specific search history infers gender, age, category and size intent from ambiguous e-commerce queries more reliably than population statistics or static profiles.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that many real-world product searches are too short to specify attributes such as gender or age group, so systems must infer those attributes from other signals. It shows that population-wide demand patterns alone leave too much ambiguity, while a given user's prior queries and browsing activity supply stronger, individualized cues. Experiments on production e-commerce traffic confirm that these user-specific signals improve inference accuracy across four intent dimensions. If the result holds, retrieval systems could return relevant results without forcing users to add clarifying terms.

Core claim

IntentTune resolves under-specified queries by combining two sources of evidence: population-level demand patterns aggregated across all users, and user-specific behavioral signals that include search history, browsing activity, and profile attributes. On real-world data the population patterns prove insufficient; user-specific signals, especially prior search queries, deliver higher accuracy when predicting gender, age group, product category, and size from the same underspecified queries.

What carries the argument

IntentTune framework that routes an ambiguous query through either aggregated demand statistics or per-user behavioral history to predict latent attributes.

If this is right

Retrieval pipelines can use recent user queries as an additional feature when ranking results for short queries.
Systems may reduce the fraction of sessions that end in reformulation by pre-resolving gender or size intent before the first result page.
Static profile fields become less critical once recent behavioral history is available.
Population-level statistics remain useful as a fallback when user history is absent or sparse.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If history-based intent signals prove stable across sessions, query suggestion engines could proactively surface attribute-specific refinements.
The same user-history approach might transfer to other underspecified search domains such as recipe or travel queries where latent constraints are common.
Production deployment would require handling cases where history is missing or privacy-restricted, potentially by blending with population patterns.

Load-bearing premise

The collected user-behavior dataset is representative of live traffic and contains accurate, low-noise signals that remain available in production.

What would settle it

Retraining and testing the same models on a second e-commerce corpus that lacks per-user query histories or contains only population aggregates; accuracy should fall to the population-only baseline.

Figures

Figures reproduced from arXiv: 2607.01530 by Chester Palen-Michel, Ishita Khan, Jayanth Yetukuri, Mehran Elyasi, Rachith Aiyappa, Samarth Agrawal, Shuang Zhou.

**Figure 1.** Figure 1: Overview of the IntentTune framework. Queries that are labeled as “unspecified” by existing intent models [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗

**Figure 2.** Figure 2: Historical-query-based personalization (right; [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

read the original abstract

Understanding user intent is fundamental to delivering relevant search results in e-commerce. However, substantial fraction of real-world queries are under-specified (e.g., "watch" or "shirt"), lacking explicit attributes such as gender or age group. This ambiguity poses a significant challenge for query intent detection models in e-commerce search systems, which must accurately infer latent user intent (e.g., age, gender) to support effective downstream retrieval. We introduce IntentTune, a framework for resolving ambiguous or under-specified query intents by leveraging either (1) user-specific behavioral signals including search history, browsing activity, and profile attributes or (2) population-level demand patterns aggregated across all users. Through experiments on real-world e-commerce data, we first demonstrate that population-level demand patterns alone are insufficient to reliably infer intent in under-specified queries. We then demonstrate that user-specific behavioral signals -- particularly prior search queries -- outperform both population-level statistics and static profile information for inferring gender, age group, product category, and size intent from underspecified queries.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

User search history beats population patterns and static profiles at guessing intent on vague e-commerce queries, but the abstract supplies no methods or metrics to judge the experiments.

read the letter

The core finding is that prior user queries give a clearer signal than aggregated demand or profile data when filling in missing attributes like gender, age, category, or size on short queries. The paper sets up IntentTune to test both routes on real e-commerce logs and reports that the personal route wins.

That comparison is the useful part. It directly addresses a common production issue where a large share of traffic is underspecified, and it shows population-level stats fall short on their own. The four intent dimensions are a reasonable scope for the claim.

The main limitation is the lack of any description of the actual setup. No metrics, no baseline details, no sample sizes, no controls for query frequency or user activity level. Without those, the outperformance claim cannot be evaluated. The dataset is described only as real-world, so questions about noise, privacy filtering, or how representative it is remain open.

This work is aimed at practitioners building e-commerce search stacks who already have access to user logs. Someone in that setting could extract a practical takeaway if the full paper supplies the missing experimental controls. A reader outside that niche will find little to take away.

It should go to peer review. The empirical direction is clear enough and the problem is real; referees can check whether the numbers actually support the headline result.

Referee Report

1 major / 0 minor

Summary. The manuscript introduces IntentTune, a framework for resolving under-specified query intents in e-commerce search (e.g., inferring gender, age group, product category, or size from queries like "watch" or "shirt") by leveraging either user-specific behavioral signals (search history, browsing activity, profile attributes) or population-level demand patterns. It claims that experiments on real-world e-commerce data demonstrate population-level patterns are insufficient and that user-specific signals, particularly prior search queries, outperform both population-level statistics and static profile information.

Significance. If the empirical results hold with proper validation, the work addresses a practical challenge in e-commerce search by showing the value of personalization over aggregate statistics for intent inference, which could improve retrieval relevance for ambiguous queries.

major comments (1)

Abstract: The abstract asserts experimental outperformance of user-specific behavioral signals but supplies no information on methods, metrics, controls, sample sizes, or statistical tests, making it impossible to assess whether the data actually support the claim as stated.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback on the abstract. We agree that additional details would strengthen the presentation of our claims and will revise the manuscript accordingly.

read point-by-point responses

Referee: [—] Abstract: The abstract asserts experimental outperformance of user-specific behavioral signals but supplies no information on methods, metrics, controls, sample sizes, or statistical tests, making it impossible to assess whether the data actually support the claim as stated.

Authors: We agree that the abstract is high-level and omits specific experimental details. In the revised version we will expand the abstract to reference the primary metrics (accuracy and F1-score), note the use of large-scale real-world e-commerce logs with user behavioral signals, and state that statistical significance testing was performed. Full descriptions of methods, controls, sample sizes, and results remain in Sections 4–5; the abstract revision will be kept concise to respect length limits while improving assessability. revision: yes

Circularity Check

0 steps flagged

No significant circularity; purely empirical claim

full rationale

The paper presents an empirical comparison on real-world e-commerce data showing that user-specific behavioral signals (especially prior queries) outperform population-level statistics and static profiles for inferring latent intents from underspecified queries. No equations, derivations, fitted parameters renamed as predictions, or self-citation chains appear in the provided text. The central claim rests on experimental results rather than any self-referential construction, making the argument self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no details on free parameters, axioms, or invented entities can be extracted.

pith-pipeline@v0.9.1-grok · 5734 in / 1101 out tokens · 22140 ms · 2026-07-03T18:08:34.080197+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

48 extracted references · 17 canonical work pages · 2 internal anchors

[1]

Companion of the 2024 International Conference on Management of Data , pages =

Yu, Changlong and Liu, Xin and Maia, Jefferson and Li, Yang and Cao, Tianyu and Gao, Yifan and Song, Yangqiu and Goutam, Rahul and Zhang, Haiyang and Yin, Bing and Li, Zheng , title =. Companion of the 2024 International Conference on Management of Data , pages =. 2024 , isbn =. doi:10.1145/3626246.3653398 , abstract =

work page doi:10.1145/3626246.3653398 2024
[2]

FABRIC : Fully-Automated Broad Intent Categorization in E -commerce

Tigunova, Anna and Schmidt, Philipp and Akcora, Damla Ezgi. FABRIC : Fully-Automated Broad Intent Categorization in E -commerce. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track. 2025. doi:10.18653/v1/2025.emnlp-industry.29

work page doi:10.18653/v1/2025.emnlp-industry.29 2025
[3]

Generative Models for Product Attribute Extraction

Blume, Ansel and Zalmout, Nasser and Ji, Heng and Li, Xian. Generative Models for Product Attribute Extraction. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track. 2023. doi:10.18653/v1/2023.emnlp-industry.55

work page doi:10.18653/v1/2023.emnlp-industry.55 2023
[4]

Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , pages =

Luo, Xusheng and Bo, Le and Wu, Jinhang and Li, Lin and Luo, Zhiy and Yang, Yonghua and Yang, Keping , title =. Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , pages =. 2021 , isbn =. doi:10.1145/3447548.3467203 , abstract =

work page doi:10.1145/3447548.3467203 2021
[5]

2025 , eprint=

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory , author=. 2025 , eprint=

2025
[6]

ArXiv , year=

PersonaMem-v2: Towards Personalized Intelligence via Learning Implicit User Personas and Agentic Memory , author=. ArXiv , year=
[7]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages =

Luo, Chen and Goutam, Rahul and Zhang, Haiyang and Zhang, Chao and Song, Yangqiu and Yin, Bing , title =. Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages =. 2023 , isbn =. doi:10.1145/3539618.3591858 , abstract =

work page doi:10.1145/3539618.3591858 2023
[8]

Explicit Attribute Extraction in e-Commerce Search

Loughnane, Robyn and Liu, Jiaxin and Chen, Zhilin and Wang, Zhiqi and Giroux, Joseph and Du, Tianchuan and Schroeder, Benjamin and Sun, Weiyi. Explicit Attribute Extraction in e-Commerce Search. Proceedings of the Seventh Workshop on e-Commerce and NLP @ LREC-COLING 2024. 2024

2024
[9]

Personalizing Dialogue Agents: I have a dog, do you have pets too?

Zhang, Saizheng and Dinan, Emily and Urbanek, Jack and Szlam, Arthur and Kiela, Douwe and Weston, Jason. Personalizing Dialogue Agents: I have a dog, do you have pets too?. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2018. doi:10.18653/v1/P18-1205

work page doi:10.18653/v1/p18-1205 2018
[10]

Bruce , title =

Ai, Qingyao and Zhang, Yongfeng and Bi, Keping and Chen, Xu and Croft, W. Bruce , title =. Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages =. 2017 , isbn =. doi:10.1145/3077136.3080813 , abstract =

work page doi:10.1145/3077136.3080813 2017
[11]

MIND : A Large-scale Dataset for News Recommendation

Wu, Fangzhao and Qiao, Ying and Chen, Jiun-Hung and Wu, Chuhan and Qi, Tao and Lian, Jianxun and Liu, Danyang and Xie, Xing and Gao, Jianfeng and Wu, Winnie and Zhou, Ming. MIND : A Large-scale Dataset for News Recommendation. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020. doi:10.18653/v1/2020.acl-main.331

work page doi:10.18653/v1/2020.acl-main.331 2020
[12]

Proceedings of the 7th ACM International Conference on Web Search and Data Mining , pages =

Serdyukov, Pavel and Dupret, Georges and Craswell, Nick , title =. Proceedings of the 7th ACM International Conference on Web Search and Data Mining , pages =. 2014 , isbn =. doi:10.1145/2556195.2556207 , abstract =

work page doi:10.1145/2556195.2556207 2014
[13]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '23), July 23--27, 2023, Taipei, Taiwan , series =

Bernard, Nolwenn and Balog, Krisztian , title =. Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '23), July 23--27, 2023, Taipei, Taiwan , series =

2023
[14]

P ersona L ens: A Benchmark for Personalization Evaluation in Conversational AI Assistants

Zhao, Zheng and Vania, Clara and Kayal, Subhradeep and Khan, Naila and Cohen, Shay B and Yilmaz, Emine. P ersona L ens: A Benchmark for Personalization Evaluation in Conversational AI Assistants. Findings of the Association for Computational Linguistics: ACL 2025. 2025. doi:10.18653/v1/2025.findings-acl.927

work page doi:10.18653/v1/2025.findings-acl.927 2025
[15]

and Radlinski, Filip and White, Ryen W

Bennett, Paul N. and Radlinski, Filip and White, Ryen W. and Yilmaz, Emine , title =. Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages =. 2011 , isbn =. doi:10.1145/2009916.2009938 , abstract =

work page doi:10.1145/2009916.2009938 2011
[16]

Companion Proceedings of the ACM Web Conference 2023 , pages=

Knowledge Graph-Enhanced Neural Query Rewriting , author=. Companion Proceedings of the ACM Web Conference 2023 , pages=. 2023 , url=

2023
[17]

Yetukuri, Jayanth and Khan, Ishita , title =. Proceedings of the ACM SIGIR Workshop on eCommerce 2025 co-located with the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025) , year =

2025
[18]

Proceedings of the 2021 Conference on Human Information Interaction and Retrieval , pages=

Dataset of Natural Language Queries for E-Commerce , author=. Proceedings of the 2021 Conference on Human Information Interaction and Retrieval , pages=. 2021 , url=

2021
[19]

A Chain-of-Thought Approach to Semantic Query Categorization in e-Commerce Taxonomies , author=. Proceedings of the ACM SIGIR Workshop on eCommerce 2025 co-located with the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025) , year=

2025
[20]

Distributed Word Representations Improve NER for e-Commerce

Joshi, Mahesh and Hart, Ethan and Vogel, Mirko and Ruvini, Jean-David. Distributed Word Representations Improve NER for e-Commerce. Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing. 2015. doi:10.3115/v1/W15-1522

work page doi:10.3115/v1/w15-1522 2015
[21]

Q uery NER : Segmentation of E -commerce Queries

Palen-Michel, Chester and Liang, Lizzie and Wu, Zhe and Lignos, Constantine. Q uery NER : Segmentation of E -commerce Queries. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). 2024

2024
[22]

E -Commerce Product Categorization with LLM -based Dual-Expert Classification Paradigm

Cheng, Zhu and Zhang, Wen and Chou, Chih-Chi and Jau, You-Yi and Pathak, Archita and Gao, Peng and Batur, Umit. E -Commerce Product Categorization with LLM -based Dual-Expert Classification Paradigm. Proceedings of the 1st Workshop on Customizable NLP: Progress and Challenges in Customizing NLP for a Domain, Application, Group, or Individual (CustomNLP4U)...

work page doi:10.18653/v1/2024.customnlp4u-1.22 2024
[23]

2020 , publisher=

Query understanding for search engines , author=. 2020 , publisher=

2020
[24]

ACM Computing Surveys , volume=

How to approach ambiguous queries in conversational search: A survey of techniques, approaches, tools, and challenges , author=. ACM Computing Surveys , volume=. 2022 , publisher=

2022
[25]

Bert: Pre-training of deep bidirectional transformers for language understanding , author=. Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers) , pages=

2019
[26]

Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval , pages=

Query reformulation in e-commerce search , author=. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval , pages=
[27]

ACM Transactions on Information Systems (TOIS) , volume=

Modeling reformulation using query distributions , author=. ACM Transactions on Information Systems (TOIS) , volume=. 2013 , publisher=

2013
[28]

Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining , pages=

REFLEX: Reinforcement Feedback Learning with Large Language Models for E-commerce Query Expansion , author=. Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining , pages=
[29]

1994 , publisher=

Okapi at TREC , author=. 1994 , publisher=

1994
[30]

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management , pages=

Enhancing relevance of embedding-based retrieval at walmart , author=. Proceedings of the 33rd ACM International Conference on Information and Knowledge Management , pages=
[31]

Lost in the Middle: How Language Models Use Long Contexts

Lost in the middle: How language models use long contexts, 2023 , author=. URL https://arxiv. org/abs/2307.03172 , volume=

work page internal anchor Pith review Pith/arXiv arXiv 2023
[32]

Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=

Embedding-based retrieval in facebook search , author=. Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=
[33]

Proceedings of the 22nd ACM international conference on Information & Knowledge Management , pages=

Learning deep structured semantic models for web search using clickthrough data , author=. Proceedings of the 22nd ACM international conference on Information & Knowledge Management , pages=
[34]

Proceedings of the 13th ACM conference on recommender systems , pages=

Sampling-bias-corrected neural modeling for large corpus item recommendations , author=. Proceedings of the 13th ACM conference on recommender systems , pages=
[35]

Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval , pages=

Colbert: Efficient and effective passage search via contextualized late interaction over bert , author=. Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval , pages=
[36]

MS MARCO: A Human Generated MAchine Reading COmprehension Dataset

MS MARCO: A human generated machine reading comprehension dataset , author=. arXiv preprint arXiv:1611.09268 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[37]

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval , pages=

Personalizing search via automated analysis of interests and activities , author=. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval , pages=
[38]

The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05) , pages=

Personalized search based on user search histories , author=. The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05) , pages=. 2005 , organization=

2005
[39]

Computer , volume=

Matrix factorization techniques for recommender systems , author=. Computer , volume=. 2009 , publisher=

2009
[40]

2010 IEEE International conference on data mining , pages=

Factorization machines , author=. 2010 IEEE International conference on data mining , pages=. 2010 , organization=

2010
[41]

Proceedings of the ACM Web Conference 2024 , pages=

Knowledge-augmented large language models for personalized contextual query suggestion , author=. Proceedings of the ACM Web Conference 2024 , pages=

2024
[42]

Acm Computing Surveys (CSUR) , volume=

A survey of automatic query expansion in information retrieval , author=. Acm Computing Surveys (CSUR) , volume=. 2012 , publisher=

2012
[43]

Proceedings of the 2009 Workshop on Web Search Click Data , pages=

Survey and evaluation of query intent detection methods , author=. Proceedings of the 2009 Workshop on Web Search Click Data , pages=

2009
[44]

Proceedings of the 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023) , pages=

Representation learning for hierarchical classification of entity titles , author=. Proceedings of the 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023) , pages=

2023
[45]

Proceedings of the 14th international conference on World Wide Web , pages=

Automatic identification of user goals in web search , author=. Proceedings of the 14th international conference on World Wide Web , pages=
[46]

arXiv preprint arXiv:2107.08291 , year=

Neural search: Learning query and product representations in fashion e-commerce , author=. arXiv preprint arXiv:2107.08291 , year=

work page arXiv
[47]

International symposium on string processing and information retrieval , pages=

The intention behind web queries , author=. International symposium on string processing and information retrieval , pages=. 2006 , organization=

2006
[48]

arXiv preprint arXiv:2008.07559 , year=

Resolving intent ambiguities by retrieving discriminative clarifying questions , author=. arXiv preprint arXiv:2008.07559 , year=

work page arXiv 2008

[1] [1]

Companion of the 2024 International Conference on Management of Data , pages =

Yu, Changlong and Liu, Xin and Maia, Jefferson and Li, Yang and Cao, Tianyu and Gao, Yifan and Song, Yangqiu and Goutam, Rahul and Zhang, Haiyang and Yin, Bing and Li, Zheng , title =. Companion of the 2024 International Conference on Management of Data , pages =. 2024 , isbn =. doi:10.1145/3626246.3653398 , abstract =

work page doi:10.1145/3626246.3653398 2024

[2] [2]

FABRIC : Fully-Automated Broad Intent Categorization in E -commerce

Tigunova, Anna and Schmidt, Philipp and Akcora, Damla Ezgi. FABRIC : Fully-Automated Broad Intent Categorization in E -commerce. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track. 2025. doi:10.18653/v1/2025.emnlp-industry.29

work page doi:10.18653/v1/2025.emnlp-industry.29 2025

[3] [3]

Generative Models for Product Attribute Extraction

Blume, Ansel and Zalmout, Nasser and Ji, Heng and Li, Xian. Generative Models for Product Attribute Extraction. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track. 2023. doi:10.18653/v1/2023.emnlp-industry.55

work page doi:10.18653/v1/2023.emnlp-industry.55 2023

[4] [4]

Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , pages =

Luo, Xusheng and Bo, Le and Wu, Jinhang and Li, Lin and Luo, Zhiy and Yang, Yonghua and Yang, Keping , title =. Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , pages =. 2021 , isbn =. doi:10.1145/3447548.3467203 , abstract =

work page doi:10.1145/3447548.3467203 2021

[5] [5]

2025 , eprint=

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory , author=. 2025 , eprint=

2025

[6] [6]

ArXiv , year=

PersonaMem-v2: Towards Personalized Intelligence via Learning Implicit User Personas and Agentic Memory , author=. ArXiv , year=

[7] [7]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages =

Luo, Chen and Goutam, Rahul and Zhang, Haiyang and Zhang, Chao and Song, Yangqiu and Yin, Bing , title =. Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages =. 2023 , isbn =. doi:10.1145/3539618.3591858 , abstract =

work page doi:10.1145/3539618.3591858 2023

[8] [8]

Explicit Attribute Extraction in e-Commerce Search

Loughnane, Robyn and Liu, Jiaxin and Chen, Zhilin and Wang, Zhiqi and Giroux, Joseph and Du, Tianchuan and Schroeder, Benjamin and Sun, Weiyi. Explicit Attribute Extraction in e-Commerce Search. Proceedings of the Seventh Workshop on e-Commerce and NLP @ LREC-COLING 2024. 2024

2024

[9] [9]

Personalizing Dialogue Agents: I have a dog, do you have pets too?

Zhang, Saizheng and Dinan, Emily and Urbanek, Jack and Szlam, Arthur and Kiela, Douwe and Weston, Jason. Personalizing Dialogue Agents: I have a dog, do you have pets too?. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2018. doi:10.18653/v1/P18-1205

work page doi:10.18653/v1/p18-1205 2018

[10] [10]

Bruce , title =

Ai, Qingyao and Zhang, Yongfeng and Bi, Keping and Chen, Xu and Croft, W. Bruce , title =. Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages =. 2017 , isbn =. doi:10.1145/3077136.3080813 , abstract =

work page doi:10.1145/3077136.3080813 2017

[11] [11]

MIND : A Large-scale Dataset for News Recommendation

Wu, Fangzhao and Qiao, Ying and Chen, Jiun-Hung and Wu, Chuhan and Qi, Tao and Lian, Jianxun and Liu, Danyang and Xie, Xing and Gao, Jianfeng and Wu, Winnie and Zhou, Ming. MIND : A Large-scale Dataset for News Recommendation. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020. doi:10.18653/v1/2020.acl-main.331

work page doi:10.18653/v1/2020.acl-main.331 2020

[12] [12]

Proceedings of the 7th ACM International Conference on Web Search and Data Mining , pages =

Serdyukov, Pavel and Dupret, Georges and Craswell, Nick , title =. Proceedings of the 7th ACM International Conference on Web Search and Data Mining , pages =. 2014 , isbn =. doi:10.1145/2556195.2556207 , abstract =

work page doi:10.1145/2556195.2556207 2014

[13] [13]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '23), July 23--27, 2023, Taipei, Taiwan , series =

Bernard, Nolwenn and Balog, Krisztian , title =. Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '23), July 23--27, 2023, Taipei, Taiwan , series =

2023

[14] [14]

P ersona L ens: A Benchmark for Personalization Evaluation in Conversational AI Assistants

Zhao, Zheng and Vania, Clara and Kayal, Subhradeep and Khan, Naila and Cohen, Shay B and Yilmaz, Emine. P ersona L ens: A Benchmark for Personalization Evaluation in Conversational AI Assistants. Findings of the Association for Computational Linguistics: ACL 2025. 2025. doi:10.18653/v1/2025.findings-acl.927

work page doi:10.18653/v1/2025.findings-acl.927 2025

[15] [15]

and Radlinski, Filip and White, Ryen W

Bennett, Paul N. and Radlinski, Filip and White, Ryen W. and Yilmaz, Emine , title =. Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages =. 2011 , isbn =. doi:10.1145/2009916.2009938 , abstract =

work page doi:10.1145/2009916.2009938 2011

[16] [16]

Companion Proceedings of the ACM Web Conference 2023 , pages=

Knowledge Graph-Enhanced Neural Query Rewriting , author=. Companion Proceedings of the ACM Web Conference 2023 , pages=. 2023 , url=

2023

[17] [17]

Yetukuri, Jayanth and Khan, Ishita , title =. Proceedings of the ACM SIGIR Workshop on eCommerce 2025 co-located with the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025) , year =

2025

[18] [18]

Proceedings of the 2021 Conference on Human Information Interaction and Retrieval , pages=

Dataset of Natural Language Queries for E-Commerce , author=. Proceedings of the 2021 Conference on Human Information Interaction and Retrieval , pages=. 2021 , url=

2021

[19] [19]

A Chain-of-Thought Approach to Semantic Query Categorization in e-Commerce Taxonomies , author=. Proceedings of the ACM SIGIR Workshop on eCommerce 2025 co-located with the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025) , year=

2025

[20] [20]

Distributed Word Representations Improve NER for e-Commerce

Joshi, Mahesh and Hart, Ethan and Vogel, Mirko and Ruvini, Jean-David. Distributed Word Representations Improve NER for e-Commerce. Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing. 2015. doi:10.3115/v1/W15-1522

work page doi:10.3115/v1/w15-1522 2015

[21] [21]

Q uery NER : Segmentation of E -commerce Queries

Palen-Michel, Chester and Liang, Lizzie and Wu, Zhe and Lignos, Constantine. Q uery NER : Segmentation of E -commerce Queries. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). 2024

2024

[22] [22]

E -Commerce Product Categorization with LLM -based Dual-Expert Classification Paradigm

Cheng, Zhu and Zhang, Wen and Chou, Chih-Chi and Jau, You-Yi and Pathak, Archita and Gao, Peng and Batur, Umit. E -Commerce Product Categorization with LLM -based Dual-Expert Classification Paradigm. Proceedings of the 1st Workshop on Customizable NLP: Progress and Challenges in Customizing NLP for a Domain, Application, Group, or Individual (CustomNLP4U)...

work page doi:10.18653/v1/2024.customnlp4u-1.22 2024

[23] [23]

2020 , publisher=

Query understanding for search engines , author=. 2020 , publisher=

2020

[24] [24]

ACM Computing Surveys , volume=

How to approach ambiguous queries in conversational search: A survey of techniques, approaches, tools, and challenges , author=. ACM Computing Surveys , volume=. 2022 , publisher=

2022

[25] [25]

Bert: Pre-training of deep bidirectional transformers for language understanding , author=. Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers) , pages=

2019

[26] [26]

Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval , pages=

Query reformulation in e-commerce search , author=. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval , pages=

[27] [27]

ACM Transactions on Information Systems (TOIS) , volume=

Modeling reformulation using query distributions , author=. ACM Transactions on Information Systems (TOIS) , volume=. 2013 , publisher=

2013

[28] [28]

Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining , pages=

REFLEX: Reinforcement Feedback Learning with Large Language Models for E-commerce Query Expansion , author=. Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining , pages=

[29] [29]

1994 , publisher=

Okapi at TREC , author=. 1994 , publisher=

1994

[30] [30]

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management , pages=

Enhancing relevance of embedding-based retrieval at walmart , author=. Proceedings of the 33rd ACM International Conference on Information and Knowledge Management , pages=

[31] [31]

Lost in the Middle: How Language Models Use Long Contexts

Lost in the middle: How language models use long contexts, 2023 , author=. URL https://arxiv. org/abs/2307.03172 , volume=

work page internal anchor Pith review Pith/arXiv arXiv 2023

[32] [32]

Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=

Embedding-based retrieval in facebook search , author=. Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=

[33] [33]

Proceedings of the 22nd ACM international conference on Information & Knowledge Management , pages=

Learning deep structured semantic models for web search using clickthrough data , author=. Proceedings of the 22nd ACM international conference on Information & Knowledge Management , pages=

[34] [34]

Proceedings of the 13th ACM conference on recommender systems , pages=

Sampling-bias-corrected neural modeling for large corpus item recommendations , author=. Proceedings of the 13th ACM conference on recommender systems , pages=

[35] [35]

Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval , pages=

Colbert: Efficient and effective passage search via contextualized late interaction over bert , author=. Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval , pages=

[36] [36]

MS MARCO: A Human Generated MAchine Reading COmprehension Dataset

MS MARCO: A human generated machine reading comprehension dataset , author=. arXiv preprint arXiv:1611.09268 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[37] [37]

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval , pages=

Personalizing search via automated analysis of interests and activities , author=. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval , pages=

[38] [38]

The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05) , pages=

Personalized search based on user search histories , author=. The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05) , pages=. 2005 , organization=

2005

[39] [39]

Computer , volume=

Matrix factorization techniques for recommender systems , author=. Computer , volume=. 2009 , publisher=

2009

[40] [40]

2010 IEEE International conference on data mining , pages=

Factorization machines , author=. 2010 IEEE International conference on data mining , pages=. 2010 , organization=

2010

[41] [41]

Proceedings of the ACM Web Conference 2024 , pages=

Knowledge-augmented large language models for personalized contextual query suggestion , author=. Proceedings of the ACM Web Conference 2024 , pages=

2024

[42] [42]

Acm Computing Surveys (CSUR) , volume=

A survey of automatic query expansion in information retrieval , author=. Acm Computing Surveys (CSUR) , volume=. 2012 , publisher=

2012

[43] [43]

Proceedings of the 2009 Workshop on Web Search Click Data , pages=

Survey and evaluation of query intent detection methods , author=. Proceedings of the 2009 Workshop on Web Search Click Data , pages=

2009

[44] [44]

Proceedings of the 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023) , pages=

Representation learning for hierarchical classification of entity titles , author=. Proceedings of the 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023) , pages=

2023

[45] [45]

Proceedings of the 14th international conference on World Wide Web , pages=

Automatic identification of user goals in web search , author=. Proceedings of the 14th international conference on World Wide Web , pages=

[46] [46]

arXiv preprint arXiv:2107.08291 , year=

Neural search: Learning query and product representations in fashion e-commerce , author=. arXiv preprint arXiv:2107.08291 , year=

work page arXiv

[47] [47]

International symposium on string processing and information retrieval , pages=

The intention behind web queries , author=. International symposium on string processing and information retrieval , pages=. 2006 , organization=

2006

[48] [48]

arXiv preprint arXiv:2008.07559 , year=

Resolving intent ambiguities by retrieving discriminative clarifying questions , author=. arXiv preprint arXiv:2008.07559 , year=

work page arXiv 2008