CVGRAug 20, 2024

DEGAS: Detailed Expressions on Full-Body Gaussian Avatars

arXiv:2408.10588v213 citationsh-index: 6
Originality Incremental advance
AI Analysis

This work addresses the gap in creating expressive full-body avatars for interactive AI agents, though it is incremental by building on existing 3DGS and 2D expression techniques.

The authors tackled the problem of incorporating detailed facial expressions into full-body avatars, presenting DEGAS, a 3D Gaussian Splatting-based method that achieves photorealistic rendering with subtle and accurate expressions, as demonstrated on existing and new datasets.

Although neural rendering has made significant advances in creating lifelike, animatable full-body and head avatars, incorporating detailed expressions into full-body avatars remains largely unexplored. We present DEGAS, the first 3D Gaussian Splatting (3DGS)-based modeling method for full-body avatars with rich facial expressions. Trained on multiview videos of a given subject, our method learns a conditional variational autoencoder that takes both the body motion and facial expression as driving signals to generate Gaussian maps in the UV layout. To drive the facial expressions, instead of the commonly used 3D Morphable Models (3DMMs) in 3D head avatars, we propose to adopt the expression latent space trained solely on 2D portrait images, bridging the gap between 2D talking faces and 3D avatars. Leveraging the rendering capability of 3DGS and the rich expressiveness of the expression latent space, the learned avatars can be reenacted to reproduce photorealistic rendering images with subtle and accurate facial expressions. Experiments on an existing dataset and our newly proposed dataset of full-body talking avatars demonstrate the efficacy of our method. We also propose an audio-driven extension of our method with the help of 2D talking faces, opening new possibilities for interactive AI agents.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes