Simvla: A simple vla baseline for robotic manipulation.arXiv preprint arXiv:2602.18224, 2026

· 2026 · arXiv 2602.18224

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

What Are We Actually Benchmarking in Robot Manipulation?

cs.RO · 2026-06-02 · conditional · novelty 6.0

LIBERO and CALVIN fail multiple proposed diagnostics for shortcut solvability, statistical significance, overfitting, and data dependence, while a tiny 0.09B probe reaches near-SOTA on LIBERO.

QuoVLA: Quotient Space for Vision-Language-Action Models

cs.CV · 2026-05-24 · unverdicted · novelty 5.0

QuoVLA introduces a quotient-space framework that compresses VLM latents into action-sufficient representations via quantization and dual-branch design for better VLA generalization.

Evo-Depth: A Lightweight Depth-Enhanced Vision-Language-Action Model

cs.CV · 2026-05-14 · unverdicted · novelty 4.0

Evo-Depth is a compact VLA model using a lightweight implicit depth encoder from RGB views plus progressive alignment to boost manipulation performance without added hardware.

citing papers explorer

Showing 2 of 2 citing papers after filters.

QuoVLA: Quotient Space for Vision-Language-Action Models cs.CV · 2026-05-24 · unverdicted · none · ref 20
QuoVLA introduces a quotient-space framework that compresses VLM latents into action-sufficient representations via quantization and dual-branch design for better VLA generalization.
Evo-Depth: A Lightweight Depth-Enhanced Vision-Language-Action Model cs.CV · 2026-05-14 · unverdicted · none · ref 29
Evo-Depth is a compact VLA model using a lightweight implicit depth encoder from RGB views plus progressive alignment to boost manipulation performance without added hardware.

Simvla: A simple vla baseline for robotic manipulation.arXiv preprint arXiv:2602.18224, 2026

fields

years

verdicts

representative citing papers

citing papers explorer