The Geometry of Representational Failures in Vision Language Models

Alan Perotti; Andr\'e Panisson; Daniele Savietto; Declan Campbell; Giovanni Petri; Jonathan D. Cohen; Marco Nurisso

arxiv: 2602.07025 · v2 · pith:UIGNSIZ4new · submitted 2026-02-02 · 💻 cs.CV · cs.AI

The Geometry of Representational Failures in Vision Language Models

Daniele Savietto , Declan Campbell , Andr\'e Panisson , Marco Nurisso , Giovanni Petri , Jonathan D. Cohen , Alan Perotti This is my paper

classification 💻 cs.CV cs.AI

keywords failuresmodelvectorsvisualbehaviorconceptgeometryinternal

0 comments

read the original abstract

Vision-Language Models (VLMs) exhibit puzzling failures in multi-object visual tasks, such as hallucinating non-existent elements or failing to identify the most similar objects among distractions. While these errors mirror human cognitive constraints, such as the 'Binding Problem', the internal mechanisms driving them in artificial systems remain poorly understood. Here, we propose a mechanistic insight by analyzing the representational geometry of open-weight VLMs (Qwen, InternVL, Gemma), comparing methodologies to distill "concept vectors'' - latent directions encoding visual concepts. We validate our concept vectors via steering interventions that reliably manipulate model behavior in both simplified and naturalistic vision tasks (e.g., forcing the model to perceive a red flower as blue). We observe that the geometric overlap between these vectors strongly correlates with specific error patterns, offering a grounded quantitative framework to understand how internal representations shape model behavior and drive visual failures.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Probing for Representation Manifolds in Superposition
cs.LG 2026-05 unverdicted novelty 5.0

Introduces the Manifold Probe to discover representation manifolds in superposition and demonstrates causal steering on time concepts in Llama 2-7b.