Title resolution pending

Direct preference optimization: Your language model is secretly a reward model · 2024 · arXiv 2509.25177

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

YARD: Y-Architecture Register Decoding for Efficient Hallucination Mitigation in Large Vision-Language Models

cs.CV · 2026-05-29 · unverdicted · novelty 7.0

YARD is a training-free method using Y-shaped decoder architecture and register tokens to improve contrastive decoding for hallucination reduction in LVLMs with lower latency.

ADAPT: Attention Dynamics Alignment with Preference Tuning for Faithful MLLMs

cs.CV · 2026-06-30 · unverdicted · novelty 6.0

ADAPT reduces MLLM hallucinations 40-60% by aligning cross-attention dynamics via visual anchors, supervised inference, and preference tuning while preserving general capabilities.

MLLMs Get It Right, Then Get It Wrong: Tracing and Correcting Late-Layer Textual Bias

cs.CV · 2026-06-16 · unverdicted · novelty 6.0

MLLMs show late-layer textual override of correct visual predictions, with a directional signature enabling a simple inference-time recovery method that improves conflict benchmarks by up to 9.4%.

Reliability-Prioritized Fine-Grained Generation in Multimodal Large

cs.CV · 2026-06-28 · unverdicted · novelty 5.0 · 2 refs

Proposes GranFact benchmark with coarse-to-fine annotations and a DPO variant that penalizes unreliable fine-grained claims to improve reliable specificity in MLLM outputs.

Dismantling Pathological Shortcuts: A Causal Framework for Faithful LVLM Decoding

cs.CV · 2026-06-25 · unverdicted · novelty 5.0

Fox detects risky attention heads in LVLMs using visual attention entropy and severs hallucination shortcuts via numerical logit saturation and conflict-gated decoding, outperforming prior methods by 29.1%.

citing papers explorer

Showing 5 of 5 citing papers after filters.

YARD: Y-Architecture Register Decoding for Efficient Hallucination Mitigation in Large Vision-Language Models cs.CV · 2026-05-29 · unverdicted · none · ref 42
YARD is a training-free method using Y-shaped decoder architecture and register tokens to improve contrastive decoding for hallucination reduction in LVLMs with lower latency.
ADAPT: Attention Dynamics Alignment with Preference Tuning for Faithful MLLMs cs.CV · 2026-06-30 · unverdicted · none · ref 29
ADAPT reduces MLLM hallucinations 40-60% by aligning cross-attention dynamics via visual anchors, supervised inference, and preference tuning while preserving general capabilities.
MLLMs Get It Right, Then Get It Wrong: Tracing and Correcting Late-Layer Textual Bias cs.CV · 2026-06-16 · unverdicted · none · ref 29
MLLMs show late-layer textual override of correct visual predictions, with a directional signature enabling a simple inference-time recovery method that improves conflict benchmarks by up to 9.4%.
Reliability-Prioritized Fine-Grained Generation in Multimodal Large cs.CV · 2026-06-28 · unverdicted · none · ref 3 · 2 links
Proposes GranFact benchmark with coarse-to-fine annotations and a DPO variant that penalizes unreliable fine-grained claims to improve reliable specificity in MLLM outputs.
Dismantling Pathological Shortcuts: A Causal Framework for Faithful LVLM Decoding cs.CV · 2026-06-25 · unverdicted · none · ref 78
Fox detects risky attention heads in LVLMs using visual attention entropy and severs hallucination shortcuts via numerical logit saturation and conflict-gated decoding, outperforming prior methods by 29.1%.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer