DART adapts VLA models to environmental shifts with one demonstration using subspace-aligned weight vector arithmetic.
Action Hallucination in Generative Vision-Language-Action Models
2 Pith papers cite this work. Polarity classification is still indexing.
abstract
Robot Foundation Models, such as VLAs, promise end-to-end generative robot policies with broad generalization. Yet it remains unclear whether they fundamentally resolve the core problem of action generation in embodied settings, or overcome the long-standing challenges of robotics. We address this question by analyzing action hallucinations that violate physical constraints and their extension to plan-level failures. Focusing on latent-variable generative policies, we show that hallucinations can arise from structural mismatches between feasible robot behavior and common model architectures. We study three such barriers -- topological, precision, and horizon -- and show how they impose unavoidable tradeoffs. Our analysis provides mechanistic explanations for reported empirical failures of generative robot policies and suggests principled directions for improving reliability and trustworthiness, without abandoning their expressive power.
fields
cs.RO 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
A literature review that defines silent physical-action failures in Physical AI and identifies the lack of complete runtime authorization boundaries across surveyed technical streams.
citing papers explorer
-
Domain Arithmetic: One-Shot VLA Adaptation under Environmental Shifts
DART adapts VLA models to environmental shifts with one demonstration using subspace-aligned weight vector arithmetic.
-
Silent Failures in Physical AI: A Literature Review of Runtime Action Authorization for Autonomous Systems
A literature review that defines silent physical-action failures in Physical AI and identifies the lack of complete runtime authorization boundaries across surveyed technical streams.