Disentangling length bias in preference learning via response-conditioned modeling

Jianfeng Cai, Jinhua Zhu, Ruopei Sun, Yue Wang, Li Li, Wengang Zhou, Houqiang Li · 2025 · arXiv 2502.00814

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

When Prompts Override Vision: Prompt-Induced Hallucinations in LVLMs

cs.CV · 2026-04-23 · unverdicted · novelty 6.0

Hallucinations in LVLMs largely arise from textual priors in prompts, and can be reduced by fine-tuning with preference optimization on grounded vs. hallucinated response pairs.

Exploring the Secondary Risks of Large Language Models

cs.LG · 2025-06-14 · unverdicted · novelty 6.0

Introduces secondary risks as a new class of LLM failures from benign prompts, defines two primitives, proposes SecLens search framework, and releases SecRiskBench showing risks are widespread across 16 models.

Rethinking the Comparison Unit in Sequence-Level Reinforcement Learning: An Equal-Length Paired Training Framework from Loss Correction to Sample Construction

cs.LG · 2026-04-19 · unverdicted · novelty 5.0

EqLen reframes length bias in sequence-level RL as a comparison-unit construction problem and builds equal-length training segments via dual-track generation, prefix inheritance, and segment masking.

citing papers explorer

Showing 3 of 3 citing papers.

When Prompts Override Vision: Prompt-Induced Hallucinations in LVLMs cs.CV · 2026-04-23 · unverdicted · none · ref 4
Hallucinations in LVLMs largely arise from textual priors in prompts, and can be reduced by fine-tuning with preference optimization on grounded vs. hallucinated response pairs.
Exploring the Secondary Risks of Large Language Models cs.LG · 2025-06-14 · unverdicted · none · ref 6
Introduces secondary risks as a new class of LLM failures from benign prompts, defines two primitives, proposes SecLens search framework, and releases SecRiskBench showing risks are widespread across 16 models.
Rethinking the Comparison Unit in Sequence-Level Reinforcement Learning: An Equal-Length Paired Training Framework from Loss Correction to Sample Construction cs.LG · 2026-04-19 · unverdicted · none · ref 17
EqLen reframes length bias in sequence-level RL as a comparison-unit construction problem and builds equal-length training segments via dual-track generation, prefix inheritance, and segment masking.

Disentangling length bias in preference learning via response-conditioned modeling

fields

years

verdicts

representative citing papers

citing papers explorer