Title resolution pending

Chaeyoung Jung, Youngjoon Jang, Joon Son Chung · 2025 · arXiv 2505.20862

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Don't Let the Video Speak: Audio-Contrastive Preference Optimization for Audio-Visual Language Models

cs.CV · 2026-04-15 · unverdicted · novelty 7.0

Audio-Contrastive Preference Optimization (ACPO) mitigates audio hallucination in AVLMs via output-contrastive and input-contrastive objectives that enforce faithful audio grounding.

Temporal Contrastive Decoding: A Training-Free Method for Large Audio-Language Models

cs.SD · 2026-04-16 · unverdicted · novelty 6.0

Temporal Contrastive Decoding mitigates temporal smoothing bias in unified large audio-language models by contrasting logits from original and blurred audio inputs during decoding, yielding consistent gains on MMAU and AIR-Bench.

STEAR: Layer-Aware Spatiotemporal Evidence Intervention for Hallucination Mitigation in Video Large Language Models

cs.CV · 2026-04-03 · unverdicted · novelty 6.0

STEAR reduces spatial and temporal hallucinations in Video-LLMs via layer-aware evidence intervention from middle decoder layers in a single-encode pass.

Distorted or Fabricated? A Survey on Hallucination in Video LLMs

cs.CV · 2026-04-14 · unverdicted · novelty 5.0

The survey organizes hallucinations in Vid-LLMs into dynamic distortion and content fabrication, reviews evaluation benchmarks and mitigation methods, and traces root causes to weak temporal modeling and visual grounding.

citing papers explorer

Showing 4 of 4 citing papers.

Don't Let the Video Speak: Audio-Contrastive Preference Optimization for Audio-Visual Language Models cs.CV · 2026-04-15 · unverdicted · none · ref 21
Audio-Contrastive Preference Optimization (ACPO) mitigates audio hallucination in AVLMs via output-contrastive and input-contrastive objectives that enforce faithful audio grounding.
Temporal Contrastive Decoding: A Training-Free Method for Large Audio-Language Models cs.SD · 2026-04-16 · unverdicted · none · ref 2
Temporal Contrastive Decoding mitigates temporal smoothing bias in unified large audio-language models by contrasting logits from original and blurred audio inputs during decoding, yielding consistent gains on MMAU and AIR-Bench.
STEAR: Layer-Aware Spatiotemporal Evidence Intervention for Hallucination Mitigation in Video Large Language Models cs.CV · 2026-04-03 · unverdicted · none · ref 18
STEAR reduces spatial and temporal hallucinations in Video-LLMs via layer-aware evidence intervention from middle decoder layers in a single-encode pass.
Distorted or Fabricated? A Survey on Hallucination in Video LLMs cs.CV · 2026-04-14 · unverdicted · none · ref 2
The survey organizes hallucinations in Vid-LLMs into dynamic distortion and content fabrication, reviews evaluation benchmarks and mitigation methods, and traces root causes to weak temporal modeling and visual grounding.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer