Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation

Authors

  • Jia Li University of Science and Technology of China
  • Wen Su Zhejiang Sci-Tech University
  • Zengfu Wang Institute of Intelligent Machines, Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v34i07.6797

Abstract

We rethink a well-known bottom-up approach for multi-person pose estimation and propose an improved one. The improved approach surpasses the baseline significantly thanks to (1) an intuitional yet more sensible representation, which we refer to as body parts to encode the connection information between keypoints, (2) an improved stacked hourglass network with attention mechanisms, (3) a novel focal L2 loss which is dedicated to “hard” keypoint and keypoint association (body part) mining, and (4) a robust greedy keypoint assignment algorithm for grouping the detected keypoints into individual poses. Our approach not only works straightforwardly but also outperforms the baseline by about 15% in average precision and is comparable to the state of the art on the MS-COCO test-dev dataset. The code and pre-trained models are publicly available on our project page1.

Downloads

Published

2020-04-03

How to Cite

Li, J., Su, W., & Wang, Z. (2020). Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation. Proceedings of the AAAI Conference on Artificial Intelligence, 34(07), 11354-11361. https://doi.org/10.1609/aaai.v34i07.6797

Issue

Section

AAAI Technical Track: Vision