CVNov 30, 2018

Parsing R-CNN for Instance-Level Human Analysis

arXiv:1811.12596v1131 citations
Originality Incremental advance
AI Analysis

This work addresses instance-level human analysis for computer vision applications, representing an incremental improvement with strong specific gains.

The paper tackles instance-level human analysis, such as segmentation and pose estimation, by proposing Parsing R-CNN, an end-to-end pipeline that outperforms state-of-the-art methods on CIHP, MHP v2.0, and DensePose-COCO datasets, achieving first place in the COCO 2018 DensePose Estimation challenge.

Instance-level human analysis is common in real-life scenarios and has multiple manifestations, such as human part segmentation, dense pose estimation, human-object interactions, etc. Models need to distinguish different human instances in the image panel and learn rich features to represent the details of each instance. In this paper, we present an end-to-end pipeline for solving the instance-level human analysis, named Parsing R-CNN. It processes a set of human instances simultaneously through comprehensive considering the characteristics of region-based approach and the appearance of a human, thus allowing representing the details of instances. Parsing R-CNN is very flexible and efficient, which is applicable to many issues in human instance analysis. Our approach outperforms all state-of-the-art methods on CIHP (Crowd Instance-level Human Parsing), MHP v2.0 (Multi-Human Parsing) and DensePose-COCO datasets. Based on the proposed Parsing R-CNN, we reach the 1st place in the COCO 2018 Challenge DensePose Estimation task. Code and models are public available.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes