Dexgraspvla: A vision-language-action framework towards general dexterous grasping

Yifan Zhong, Xuchuan Huang, Ruochong Li, Ceyao Zhang, Zhang Chen, Tianrui Guan + 2 more · 2026 · Proceedings of the AAAI Conference on Artificial Intelligence · DOI 10.1609/aaai.v40i22.38953

2 Pith papers cite this work, alongside 3 external citations. Polarity classification is still indexing.

2 Pith papers citing it

3 external citations · Crossref

open at publisher browse 2 citing papers

representative citing papers

GEAR-VLA: Learning Geometry-Aware Action Representations for Generalizable Robotic Manipulation

cs.RO · 2026-06-07 · unverdicted · novelty 6.0

GEAR-VLA learns geometry-aware action representations via coarse-to-fine pretraining, gradient-decoupled DiT action expert, semantic-aligned 3D integration, and embodiment canonicalization, reporting SOTA results on LIBERO benchmarks and over 80% success on unseen embodiments and 212 unseen objects.

Hand-in-the-Loop: Improving VLA Policies for Dexterous Manipulation via Seamless Hand-Arm Intervention

cs.RO · 2026-05-14 · unverdicted · novelty 6.0 · 2 refs

HandITL enables seamless human intervention in VLA policies for bimanual dexterous manipulation, cutting jitter by 99.8% and improving refined policies by 19% over standard teleoperation.

citing papers explorer

Showing 2 of 2 citing papers.

GEAR-VLA: Learning Geometry-Aware Action Representations for Generalizable Robotic Manipulation cs.RO · 2026-06-07 · unverdicted · none · ref 13
GEAR-VLA learns geometry-aware action representations via coarse-to-fine pretraining, gradient-decoupled DiT action expert, semantic-aligned 3D integration, and embodiment canonicalization, reporting SOTA results on LIBERO benchmarks and over 80% success on unseen embodiments and 212 unseen objects.
Hand-in-the-Loop: Improving VLA Policies for Dexterous Manipulation via Seamless Hand-Arm Intervention cs.RO · 2026-05-14 · unverdicted · none · ref 33 · 2 links
HandITL enables seamless human intervention in VLA policies for bimanual dexterous manipulation, cutting jitter by 99.8% and improving refined policies by 19% over standard teleoperation.

Dexgraspvla: A vision-language-action framework towards general dexterous grasping

fields

years

verdicts

representative citing papers

citing papers explorer