A survey of multimodal hallucination evaluation and detection

Hao Liu et al · 2025 · arXiv 2507.19024

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Once Correct, Still Wrong: Counterfactual Hallucination in Multilingual Vision-Language Models

cs.CL · 2026-02-05 · unverdicted · novelty 7.0

VLMs exhibit sharply higher counterfactual hallucination rates in Arabic and dialects despite high true-statement accuracy, revealed by the new M²CQA benchmark and CFHR metric.

TASTE: A Designer-Annotated Multi-Dimensional Preference Dataset for AI-Generated Graphic Design

cs.CV · 2026-05-20

citing papers explorer

Showing 2 of 2 citing papers.

Once Correct, Still Wrong: Counterfactual Hallucination in Multilingual Vision-Language Models cs.CL · 2026-02-05 · unverdicted · none · ref 2
VLMs exhibit sharply higher counterfactual hallucination rates in Arabic and dialects despite high true-statement accuracy, revealed by the new M²CQA benchmark and CFHR metric.
TASTE: A Designer-Annotated Multi-Dimensional Preference Dataset for AI-Generated Graphic Design cs.CV · 2026-05-20 · unreviewed · ref 26

A survey of multimodal hallucination evaluation and detection

fields

years

verdicts

representative citing papers

citing papers explorer