CVGRDec 1, 2021

FaceTuneGAN: Face Autoencoder for Convolutional Expression Transfer Using Neural Generative Adversarial Networks

arXiv:2112.00532v125 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of realistic 3D facial animation for applications like gaming and virtual reality, though it is incremental as it adapts existing 2D methods to 3D.

The paper tackles the problem of 3D facial expression transfer by proposing FaceTuneGAN, a model that decomposes and encodes facial identity and expression separately, resulting in better identity decomposition and face neutralization than state-of-the-art techniques, with less artifacts in blendshape predictions.

In this paper, we present FaceTuneGAN, a new 3D face model representation decomposing and encoding separately facial identity and facial expression. We propose a first adaptation of image-to-image translation networks, that have successfully been used in the 2D domain, to 3D face geometry. Leveraging recently released large face scan databases, a neural network has been trained to decouple factors of variations with a better knowledge of the face, enabling facial expressions transfer and neutralization of expressive faces. Specifically, we design an adversarial architecture adapting the base architecture of FUNIT and using SpiralNet++ for our convolutional and sampling operations. Using two publicly available datasets (FaceScape and CoMA), FaceTuneGAN has a better identity decomposition and face neutralization than state-of-the-art techniques. It also outperforms classical deformation transfer approach by predicting blendshapes closer to ground-truth data and with less of undesired artifacts due to too different facial morphologies between source and target.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes