CLIPR learns transferable natural language rules for latent user preferences from minimal conversational input to improve LLM alignment in decision making and outperforms prior methods on three datasets plus a user study.
Bring me a snack
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Learning Transferable Latent User Preferences for Human-Aligned Decision Making
CLIPR learns transferable natural language rules for latent user preferences from minimal conversational input to improve LLM alignment in decision making and outperforms prior methods on three datasets plus a user study.