CVDec 4, 2016

DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild

arXiv:1612.01202v240 citationsHas Code
AI Analysis

This work addresses the problem of accurate facial landmark localization for computer vision researchers, though it is incremental as it builds on existing regression and segmentation ideas.

The paper tackles dense image-to-template correspondence estimation by training a fully convolutional network to map image pixels to a dense template grid using manually annotated facial landmarks, achieving state-of-the-art landmark localization results on the 300W benchmark.

In this paper we propose to learn a mapping from image pixels into a dense template grid through a fully convolutional network. We formulate this task as a regression problem and train our network by leveraging upon manually annotated facial landmarks "in-the-wild". We use such landmarks to establish a dense correspondence field between a three-dimensional object template and the input image, which then serves as the ground-truth for training our regression system. We show that we can combine ideas from semantic segmentation with regression networks, yielding a highly-accurate "quantized regression" architecture. Our system, called DenseReg, allows us to estimate dense image-to-template correspondences in a fully convolutional manner. As such our network can provide useful correspondence information as a stand-alone system, while when used as an initialization for Statistical Deformable Models we obtain landmark localization results that largely outperform the current state-of-the-art on the challenging 300W benchmark. We thoroughly evaluate our method on a host of facial analysis tasks and also provide qualitative results for dense human body correspondence. We make our code available at http://alpguler.com/DenseReg.html along with supplementary materials.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes