CVAILGSep 28, 2023

Preface: A Data-driven Volumetric Prior for Few-shot Ultra High-resolution Face Synthesis

arXiv:2309.16859v137 citationsh-index: 42
Originality Incremental advance
AI Analysis

This enables more accessible and efficient face synthesis for applications like virtual reality or entertainment, though it builds incrementally on existing NeRF methods.

The paper tackles the problem of synthesizing ultra high-resolution novel views of human faces with few input images, achieving results from as few as two casually captured views.

NeRFs have enabled highly realistic synthesis of human faces including complex appearance and reflectance effects of hair and skin. These methods typically require a large number of multi-view input images, making the process hardware intensive and cumbersome, limiting applicability to unconstrained settings. We propose a novel volumetric human face prior that enables the synthesis of ultra high-resolution novel views of subjects that are not part of the prior's training distribution. This prior model consists of an identity-conditioned NeRF, trained on a dataset of low-resolution multi-view images of diverse humans with known camera calibration. A simple sparse landmark-based 3D alignment of the training dataset allows our model to learn a smooth latent space of geometry and appearance despite a limited number of training identities. A high-quality volumetric representation of a novel subject can be obtained by model fitting to 2 or 3 camera views of arbitrary resolution. Importantly, our method requires as few as two views of casually captured images as input at inference time.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes