CVDec 30, 2020

Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation

arXiv:2012.15175v4189 citationsHas Code
AI Analysis

This work provides a significant improvement for bottom-up human pose estimation, making it more robust to scale variations and labeling ambiguities, and achieving performance comparable to top-down methods.

The paper addresses the limitations of fixed-standard-deviation Gaussian kernels in heatmap regression for bottom-up human pose estimation, especially concerning varying human scales and labeling ambiguities. They propose Scale-Adaptive Heatmap Regression (SAHR) to adjust standard deviations per keypoint, and Weight-Adaptive Heatmap Regression (WAHR) to balance foreground-background samples. This combined approach improves accuracy by +1.5AP over the state-of-the-art model, achieving 72.0AP on COCO test-dev2017.

Heatmap regression has become the most prevalent choice for nowadays human pose estimation methods. The ground-truth heatmaps are usually constructed via covering all skeletal keypoints by 2D gaussian kernels. The standard deviations of these kernels are fixed. However, for bottom-up methods, which need to handle a large variance of human scales and labeling ambiguities, the current practice seems unreasonable. To better cope with these problems, we propose the scale-adaptive heatmap regression (SAHR) method, which can adaptively adjust the standard deviation for each keypoint. In this way, SAHR is more tolerant of various human scales and labeling ambiguities. However, SAHR may aggravate the imbalance between fore-background samples, which potentially hurts the improvement of SAHR. Thus, we further introduce the weight-adaptive heatmap regression (WAHR) to help balance the fore-background samples. Extensive experiments show that SAHR together with WAHR largely improves the accuracy of bottom-up human pose estimation. As a result, we finally outperform the state-of-the-art model by +1.5AP and achieve 72.0AP on COCO test-dev2017, which is com-arable with the performances of most top-down methods. Source codes are available at https://github.com/greatlog/SWAHR-HumanPose.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes