Segment anything

· 2023

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

VoxAfford: Multi-Scale Voxel-Token Fusion for Open-Vocabulary 3D Affordance Detection

cs.CV · 2026-05-02 · unverdicted · novelty 7.0

VoxAfford fuses multi-scale voxel features into MLLM output tokens using cross-attention with a learned compatibility gate to achieve SOTA open-vocabulary 3D affordance detection with ~8% mIoU gain and zero-shot robot transfer.

GAP: Geometric Anchor Pre-training for Data-Efficient Visuomotor Learning of Manipulation Tasks

cs.RO · 2026-05-15 · unverdicted · novelty 6.0

GAP pre-trains the spatial adapter on a lightweight simulated proxy task with free object masks to generate repeatable geometric keypoints, yielding higher success rates than baselines in low-data robotic manipulation on RoboMimic and ManiSkill.

citing papers explorer

Showing 2 of 2 citing papers.

VoxAfford: Multi-Scale Voxel-Token Fusion for Open-Vocabulary 3D Affordance Detection cs.CV · 2026-05-02 · unverdicted · none · ref 52
VoxAfford fuses multi-scale voxel features into MLLM output tokens using cross-attention with a learned compatibility gate to achieve SOTA open-vocabulary 3D affordance detection with ~8% mIoU gain and zero-shot robot transfer.
GAP: Geometric Anchor Pre-training for Data-Efficient Visuomotor Learning of Manipulation Tasks cs.RO · 2026-05-15 · unverdicted · none · ref 24
GAP pre-trains the spatial adapter on a lightweight simulated proxy task with free object masks to generate repeatable geometric keypoints, yielding higher success rates than baselines in low-data robotic manipulation on RoboMimic and ManiSkill.

Segment anything

fields

years

verdicts

representative citing papers

citing papers explorer