pith. sign in

Tapir: Tracking any point with per-frame initialization and temporal refinement

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

fields

cs.CV 2 cs.RO 2

years

2026 3 2023 1

verdicts

UNVERDICTED 4

clear filters

representative citing papers

Human Universal Grasping

cs.RO · 2026-06-15 · unverdicted · novelty 7.0

HUG trains a flow-matching model on a new 1M-frame egocentric human grasp dataset to generate retargetable grasps from single RGB-D images, beating baselines by 23-34% on a new 90-object benchmark.

Turning Video Models into Generalist Robot Policies

cs.RO · 2026-05-27 · unverdicted · novelty 6.0

Decouples action-free video world models from embodiment-specific IDMs using Jacobian-based translation to achieve zero-shot cross-embodiment robot policies.

citing papers explorer

Showing 3 of 3 citing papers after filters.

  • Human Universal Grasping cs.RO · 2026-06-15 · unverdicted · none · ref 41

    HUG trains a flow-matching model on a new 1M-frame egocentric human grasp dataset to generate retargetable grasps from single RGB-D images, beating baselines by 23-34% on a new 90-object benchmark.

  • MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction cs.CV · 2026-06-17 · unverdicted · none · ref 21

    Introduces a new task of goal-conditioned 3D point motion forecasting along with a 1.16M-video dataset, a 111-category benchmark, and a model that outperforms baselines while transferring to robotics and video generation.

  • Turning Video Models into Generalist Robot Policies cs.RO · 2026-05-27 · unverdicted · none · ref 43

    Decouples action-free video world models from embodiment-specific IDMs using Jacobian-based translation to achieve zero-shot cross-embodiment robot policies.