CVMay 16, 2019

Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision

arXiv:1905.06817v1340 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of robust 3D face reconstruction from in-the-wild images for computer vision applications, representing an incremental advance by leveraging existing models like FLAME.

The paper tackles the problem of estimating 3D face shape from a single image without 3D supervision by introducing RingNet, which uses a novel loss based on identity consistency across multiple images and achieves more accurate results than methods with 3D supervision.

The estimation of 3D face shape from a single image must be robust to variations in lighting, head pose, expression, facial hair, makeup, and occlusions. Robustness requires a large training set of in-the-wild images, which by construction, lack ground truth 3D shape. To train a network without any 2D-to-3D supervision, we present RingNet, which learns to compute 3D face shape from a single image. Our key observation is that an individual's face shape is constant across images, regardless of expression, pose, lighting, etc. RingNet leverages multiple images of a person and automatically detected 2D face features. It uses a novel loss that encourages the face shape to be similar when the identity is the same and different for different people. We achieve invariance to expression by representing the face using the FLAME model. Once trained, our method takes a single image and outputs the parameters of FLAME, which can be readily animated. Additionally we create a new database of faces `not quite in-the-wild' (NoW) with 3D head scans and high-resolution images of the subjects in a wide variety of conditions. We evaluate publicly available methods and find that RingNet is more accurate than methods that use 3D supervision. The dataset, model, and results are available for research purposes at http://ringnet.is.tuebingen.mpg.de.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes