pith. sign in

hub

Vision-language-action model with open-world embodied reasoning from pretrained knowledge

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

hub tools

citation-role summary

background 2

citation-polarity summary

fields

cs.RO 11 cs.CV 1

verdicts

UNVERDICTED 12

roles

background 2

polarities

background 2

clear filters

representative citing papers

Policy-based Foveated Imaging and Perception

cs.CV · 2026-06-01 · unverdicted · novelty 6.0

A task-aware policy learned via reinforcement learning allocates high-resolution pixels on dual-stream sensors in real time, outperforming fixed or non-predictive baselines under tight pixel budgets in both simulation and 200 MP hardware tests.

Continuous Reasoning for Vision-Language-Action

cs.RO · 2026-05-29 · unverdicted · novelty 6.0

Continuous Reasoning for VLA introduces a shared Gaussian latent for continuous thoughts, trained with self-verification to improve action prediction on LIBERO-PRO and real robots.

PhysBrain 1.0 Technical Report

cs.RO · 2026-05-14 · unverdicted · novelty 5.0

PhysBrain 1.0 extracts scene elements, spatial dynamics, actions and depth relations from human egocentric video to create QA supervision for VLMs, then transfers the resulting physical priors to VLA policies via capability-preserving adaptation.

citing papers explorer

Showing 2 of 2 citing papers after filters.