CVAIDec 3, 2024

AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction

arXiv:2412.02684v134 citationsh-index: 21CVPR
Originality Incremental advance
AI Analysis

This work solves the challenge of creating detailed and consistent animatable avatars from single images for digital human modeling applications, representing an incremental advancement over existing methods.

The paper tackles the problem of generating animatable 3D human avatars from a single image by addressing issues like fine detail capture and viewpoint inconsistencies, achieving photorealistic, real-time animation with improved generalization.

Generating animatable human avatars from a single image is essential for various digital human modeling applications. Existing 3D reconstruction methods often struggle to capture fine details in animatable models, while generative approaches for controllable animation, though avoiding explicit 3D modeling, suffer from viewpoint inconsistencies in extreme poses and computational inefficiencies. In this paper, we address these challenges by leveraging the power of generative models to produce detailed multi-view canonical pose images, which help resolve ambiguities in animatable human reconstruction. We then propose a robust method for 3D reconstruction of inconsistent images, enabling real-time rendering during inference. Specifically, we adapt a transformer-based video generation model to generate multi-view canonical pose images and normal maps, pretraining on a large-scale video dataset to improve generalization. To handle view inconsistencies, we recast the reconstruction problem as a 4D task and introduce an efficient 3D modeling approach using 4D Gaussian Splatting. Experiments demonstrate that our method achieves photorealistic, real-time animation of 3D human avatars from in-the-wild images, showcasing its effectiveness and generalization capability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes