Title resolution pending

Ruili Jiang, Kehai Chen, Xuefeng Bai, Zhixuan He, Juntao Li, Muyun Yang, Tiejun Zhao, Liqiang Nie, Min Zhang · 2025 · DOI 10.1145/3773279

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

As It Was: Aligning LLM Search Evaluation with Historical User Preferences

cs.IR · 2026-07-01 · unverdicted · novelty 7.0

Augmenting LLM search judges with historical QRI cards improves Spearman correlation with user preferences by ~5% overall (91% relative on disagreements) and 15% in multilingual settings, with better alignment to live A/B test outcomes.

What Do People Actually Want From AI? Mapping Preference Plurality

cs.CL · 2026-06-04 · unverdicted · novelty 6.0

Open-ended preference data reveals substantial plurality in what people want from AI and divergent interpretations of shared values such as truthfulness.

CRPO: Character-centric Group Relative Policy Optimization for Role-aware Reasoning in Role-playing Agents

cs.CL · 2026-05-25 · unverdicted · novelty 6.0

CRPO modifies GRPO with three mechanisms—decoupling task and style rewards, adapting constraints to character complexity, and using generic responses as negative baselines—to improve character fidelity in role-playing agents.

citing papers explorer

Showing 2 of 2 citing papers after filters.

What Do People Actually Want From AI? Mapping Preference Plurality cs.CL · 2026-06-04 · unverdicted · none · ref 39
Open-ended preference data reveals substantial plurality in what people want from AI and divergent interpretations of shared values such as truthfulness.
CRPO: Character-centric Group Relative Policy Optimization for Role-aware Reasoning in Role-playing Agents cs.CL · 2026-05-25 · unverdicted · none · ref 10
CRPO modifies GRPO with three mechanisms—decoupling task and style rewards, adapting constraints to character complexity, and using generic responses as negative baselines—to improve character fidelity in role-playing agents.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer