pith. sign in

Canonical reference

Videohallucer: Evaluating intrinsic and extrinsic hallucinations in large video-language models

Canonical reference. 80% of citing Pith papers cite this work as background.

8 Pith papers citing it
Background 80% of classified citations

citation-role summary

background 4 dataset 1

citation-polarity summary

fields

cs.CV 7 cs.SE 1

years

2026 7 2024 1

representative citing papers

When Vision Speaks for Sound

cs.CV · 2026-05-13 · unverdicted · novelty 6.0

Video MLLMs show an audio-visual Clever Hans effect relying on visual-acoustic correlations rather than audio verification; Thud interventions diagnose it and a 10K-sample preference alignment improves intervention performance by 28 points.

From Priors to Perception: Grounding Video-LLMs in Physical Reality

cs.CV · 2026-05-06 · unverdicted · novelty 6.0

Video-LLMs fail physical reasoning due to semantic prior dominance rather than perception deficits; a new programmatic adversarial curriculum and visual-anchored reasoning chain enable substantial gains via standard LoRA fine-tuning.

Video-ToC: Video Tree-of-Cue Reasoning

cs.CV · 2026-04-22 · unverdicted · novelty 6.0

Video-ToC adds tree-guided cue localization, demand-based RL rewards, and automated datasets to video LLMs, reporting better results than prior methods on six understanding benchmarks plus a hallucination test.

citing papers explorer

Showing 8 of 8 citing papers.