pith. sign in

arxiv: 1901.00148 · v4 · pith:7Z3UUXN4new · submitted 2019-01-01 · 💻 cs.CV

Rethinking on Multi-Stage Networks for Human Pose Estimation

classification 💻 cs.CV
keywords multi-stagemethodsposesingle-stagecurrentdesignestimationhuman
0
0 comments X
read the original abstract

Existing pose estimation approaches fall into two categories: single-stage and multi-stage methods. While multi-stage methods are seemingly more suited for the task, their performance in current practice is not as good as single-stage methods. This work studies this issue. We argue that the current multi-stage methods' unsatisfactory performance comes from the insufficiency in various design choices. We propose several improvements, including the single-stage module design, cross stage feature aggregation, and coarse-to-fine supervision. The resulting method establishes the new state-of-the-art on both MS COCO and MPII Human Pose dataset, justifying the effectiveness of a multi-stage architecture. The source code is publicly available for further research.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. SPARK: Low Latency Single-Camera 3D Pose Estimation for Autonomous Racing using Keypoints

    cs.RO 2026-06 unverdicted novelty 4.0

    SPARK applies keypoint detection with YOLO models to monocular images for low-latency 3D pose estimation of racing opponents, claiming better accuracy and speed than prior camera methods on real racing data.