Vit- pose: Simple vision transformer baselines for human pose estimation.Advances in neural information processing sys- tems, 35:38571–38584

Yufei Xu, Jing Zhang, Qiming Zhang, Dacheng Tao · 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos

cs.CV · 2025-12-11 · unverdicted · novelty 7.0

MoCapAnything reconstructs asset-specific BVH animations from monocular video by predicting 3D joint trajectories then applying constraint-aware inverse kinematics guided by a reference prompt encoder.

citing papers explorer

Showing 1 of 1 citing paper.

MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos cs.CV · 2025-12-11 · unverdicted · none · ref 43
MoCapAnything reconstructs asset-specific BVH animations from monocular video by predicting 3D joint trajectories then applying constraint-aware inverse kinematics guided by a reference prompt encoder.

Vit- pose: Simple vision transformer baselines for human pose estimation.Advances in neural information processing sys- tems, 35:38571–38584

fields

years

verdicts

representative citing papers

citing papers explorer