Aligning Large Language Models with Implicit Preferences from User-Generated Content

Tan, Zhaoxuan, Li, Zheng, Liu, Tianyi, Wang, Haodong, Yun, Hyokun, Zeng, Ming · 2025 · DOI 10.18653/v1/2025.acl-long.384

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

On the Limits of Steering Vectors for Preference-Aligned Generation

cs.CL · 2026-07-02 · unverdicted · novelty 6.0

Empirical evaluation on the PLUME benchmark shows steering vectors vary widely in trait expressibility, degrade on task transfer, and lose effectiveness when multiple vectors are composed.

citing papers explorer

Showing 1 of 1 citing paper.

On the Limits of Steering Vectors for Preference-Aligned Generation cs.CL · 2026-07-02 · unverdicted · none · ref 20
Empirical evaluation on the PLUME benchmark shows steering vectors vary widely in trait expressibility, degrade on task transfer, and lose effectiveness when multiple vectors are composed.

Aligning Large Language Models with Implicit Preferences from User-Generated Content

fields

years

verdicts

representative citing papers

citing papers explorer