arXiv preprint arXiv:2507.09184 , url=

MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models , author= · arXiv 2507.09184

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

VIDA: A dataset for Visually Dependent Ambiguity in Multimodal Machine Translation

cs.CL · 2026-05-03 · unverdicted · novelty 7.0

VIDA provides 2,500 visually-dependent ambiguous translation examples and span-level disambiguation metrics; CoT-SFT on LVLMs improves out-of-distribution performance over standard SFT.

citing papers explorer

Showing 1 of 1 citing paper.

VIDA: A dataset for Visually Dependent Ambiguity in Multimodal Machine Translation cs.CL · 2026-05-03 · unverdicted · none · ref 51
VIDA provides 2,500 visually-dependent ambiguous translation examples and span-level disambiguation metrics; CoT-SFT on LVLMs improves out-of-distribution performance over standard SFT.

arXiv preprint arXiv:2507.09184 , url=

fields

years

verdicts

representative citing papers

citing papers explorer