arXiv preprint arXiv:2501.08329 , year=

Predicting 4d hand trajectory from monocular videos , author= · arXiv 2501.08329

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

EgoForce: Forearm-Guided Camera-Space 3D Hand Pose from a Monocular Egocentric Camera

cs.CV · 2026-05-12 · unverdicted · novelty 6.0

EgoForce recovers absolute camera-space 3D hand pose from monocular egocentric images using forearm guidance, a unified arm-hand transformer, and a closed-form ray-space solver that handles fisheye, perspective, and wide-FOV cameras.

EggHand: A Multimodal Foundation Model for Egocentric Hand Pose Forecasting

cs.CV · 2026-05-08 · unverdicted · novelty 6.0

EggHand unifies VLA action decoding with viewpoint-aware video-text encoding to forecast egocentric hand poses, achieving SOTA accuracy on EgoExo4D while remaining robust to ego-motion and controllable via language prompts.

citing papers explorer

Showing 2 of 2 citing papers.

EgoForce: Forearm-Guided Camera-Space 3D Hand Pose from a Monocular Egocentric Camera cs.CV · 2026-05-12 · unverdicted · none · ref 67
EgoForce recovers absolute camera-space 3D hand pose from monocular egocentric images using forearm guidance, a unified arm-hand transformer, and a closed-form ray-space solver that handles fisheye, perspective, and wide-FOV cameras.
EggHand: A Multimodal Foundation Model for Egocentric Hand Pose Forecasting cs.CV · 2026-05-08 · unverdicted · none · ref 47
EggHand unifies VLA action decoding with viewpoint-aware video-text encoding to forecast egocentric hand poses, achieving SOTA accuracy on EgoExo4D while remaining robust to ego-motion and controllable via language prompts.

arXiv preprint arXiv:2501.08329 , year=

fields

years

verdicts

representative citing papers

citing papers explorer