arXiv preprint arXiv:2505.13441 (2025) 5, 6, 23, 24

Deshpande, A · 2025 · arXiv 2505.13441

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Why Far Looks Up: Probing Spatial Representation in Vision-Language Models

cs.CV · 2026-05-28 · conditional · novelty 7.0

VLMs exhibit consistent vertical-distance entanglement in embeddings from perspective bias in natural images, producing accuracy gaps that a new synthetic benchmark SpatialTunnel exposes as model-intrinsic.

citing papers explorer

Showing 1 of 1 citing paper.

Why Far Looks Up: Probing Spatial Representation in Vision-Language Models cs.CV · 2026-05-28 · conditional · none · ref 19
VLMs exhibit consistent vertical-distance entanglement in embeddings from perspective bias in natural images, producing accuracy gaps that a new synthetic benchmark SpatialTunnel exposes as model-intrinsic.

arXiv preprint arXiv:2505.13441 (2025) 5, 6, 23, 24

fields

years

verdicts

representative citing papers

citing papers explorer