Why Thinking Hurts: Diagnosing and Rectifying Linguistic Inertia in Large Language Models for Recommendation

Enhong Chen; Hang Lv; Hao Wang; Luankang Zhang; Mingjia Yin; Wei Guo; Xuyang Zhi; Yonghao Huang; Yuyang Ye

arxiv: 2602.16587 · v2 · pith:FITETHI2new · submitted 2026-02-18 · 💻 cs.IR

Why Thinking Hurts: Diagnosing and Rectifying Linguistic Inertia in Large Language Models for Recommendation

Luankang Zhang , Yonghao Huang , Hang Lv , Xuyang Zhi , Mingjia Yin , Yuyang Ye , Wei Guo , Hao Wang

show 1 more author

Enhong Chen

This is my paper

classification 💻 cs.IR

keywords modelsbeforehistoricalinertialicdlinguisticreasoningrecommendation

0 comments

read the original abstract

Chain-of-Thought (CoT) reasoning is widely used to improve LLM performance, and recent foundation recommender models adopt it by generating textual reasoning before predicting target items represented by Semantic IDs (SIDs). However, we observe that enabling thinking mode in models such as OpenOneRec can degrade recommendation quality by up to 25%. We investigate this failure and identify Linguistic Inertia: when a textual CoT segment is inserted before SID generation, the model relies more on natural-language context and less on historical SID evidence. Further analyses show that this effect is amplified by reduced access to historical information and longer CoT lengths. To mitigate it, we propose Linguistic-Inertia-Calibrated Decoding (LICD), a training-free framework that combines Reasoning-Chain Compression and Bias-Subtracted Contrastive Inference. Experiments on three large-scale benchmarks show that LICD consistently outperforms both no-thinking and original-thinking baselines. Our code is available at https://anonymous.4open.science/r/LICD-4573.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

IE as Cache: Information Extraction Enhanced Agentic Reasoning
cs.CL 2026-04 unverdicted novelty 7.0

IE-as-Cache framework repurposes information extraction as a dynamic cognitive cache to improve agentic reasoning accuracy in LLMs on challenging benchmarks.
SAPO: Step-Aligned Policy Optimization for Reasoning-Based Generative Recommendation
cs.AI 2026-05 unverdicted novelty 6.0

SAPO computes per-reasoning-step group-relative advantages in RL to improve credit assignment for structured generation of semantic identifiers in recommendation systems.
Rethinking the Necessity of Adaptive Retrieval-Augmented Generation through the Lens of Adaptive Listwise Ranking
cs.IR 2026-04 unverdicted novelty 5.0

AdaRankLLM shows adaptive listwise reranking outperforms fixed-depth retrieval for most LLMs by acting as a noise filter for weak models and an efficiency optimizer for strong ones, with lower context use.