CVSep 10, 2023

MaskRenderer: 3D-Infused Multi-Mask Realistic Face Reenactment

arXiv:2309.05095v16 citationsh-index: 7
Originality Incremental advance
AI Analysis

This work addresses face reenactment for generating realistic video frames, but it is incremental as it builds on existing methods with specific improvements.

The paper tackles challenges in face reenactment, such as identity leakage and mouth movement imitation, by introducing MaskRenderer, which uses 3D modeling and multi-scale techniques to generate realistic frames, outperforming state-of-the-art models on unseen faces in the VoxCeleb1 test set.

We present a novel end-to-end identity-agnostic face reenactment system, MaskRenderer, that can generate realistic, high fidelity frames in real-time. Although recent face reenactment works have shown promising results, there are still significant challenges such as identity leakage and imitating mouth movements, especially for large pose changes and occluded faces. MaskRenderer tackles these problems by using (i) a 3DMM to model 3D face structure to better handle pose changes, occlusion, and mouth movements compared to 2D representations; (ii) a triplet loss function to embed the cross-reenactment during training for better identity preservation; and (iii) multi-scale occlusion, improving inpainting and restoring missing areas. Comprehensive quantitative and qualitative experiments conducted on the VoxCeleb1 test set, demonstrate that MaskRenderer outperforms state-of-the-art models on unseen faces, especially when the Source and Driving identities are very different.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes