CVJun 4, 2019

Face Parsing with RoI Tanh-Warping

arXiv:1906.01342v185 citations
Originality Incremental advance
AI Analysis

This work improves face parsing accuracy for applications in computer vision, though it is incremental as it builds on existing RoI-based methods.

The paper tackles the problem of face parsing by proposing a RoI Tanh-warping operator to combine central and peripheral vision, addressing limitations in handling unpredictable areas like hair, and demonstrates state-of-the-art performance on HELEN and LFW-PL benchmarks.

Face parsing computes pixel-wise label maps for different semantic components (e.g., hair, mouth, eyes) from face images. Existing face parsing literature have illustrated significant advantages by focusing on individual regions of interest (RoIs) for faces and facial components. However, the traditional crop-and-resize focusing mechanism ignores all contextual area outside the RoIs, and thus is not suitable when the component area is unpredictable, e.g. hair. Inspired by the physiological vision system of human, we propose a novel RoI Tanh-warping operator that combines the central vision and the peripheral vision together. It addresses the dilemma between a limited sized RoI for focusing and an unpredictable area of surrounding context for peripheral information. To this end, we propose a novel hybrid convolutional neural network for face parsing. It uses hierarchical local based method for inner facial components and global methods for outer facial components. The whole framework is simple and principled, and can be trained end-to-end. To facilitate future research of face parsing, we also manually relabel the training data of the HELEN dataset and will make it public. Experiments on both HELEN and LFW-PL benchmarks demonstrate that our method surpasses state-of-the-art methods.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes