CVFeb 17, 2025

HumanGif: Single-View Human Diffusion with Generative Prior

arXiv:2502.12080v37 citationsh-index: 15
Originality Incremental advance
AI Analysis

This addresses the problem of limited information in single-view inputs for 3D human creation, offering a solution for applications like animation and virtual avatars, though it builds incrementally on existing diffusion and NeRF methods.

The paper tackles the challenge of generating realistic, view-consistent, and temporally coherent 3D human avatars from a single image by proposing HumanGif, a single-view human diffusion model with generative prior, which achieves the best perceptual performance on multiple datasets.

Previous 3D human creation methods have made significant progress in synthesizing view-consistent and temporally aligned results from sparse-view images or monocular videos. However, it remains challenging to produce perpetually realistic, view-consistent, and temporally coherent human avatars from a single image, as limited information is available in the single-view input setting. Motivated by the success of 2D character animation, we propose HumanGif, a single-view human diffusion model with generative prior. Specifically, we formulate the single-view-based 3D human novel view and pose synthesis as a single-view-conditioned human diffusion process, utilizing generative priors from foundational diffusion models to complement the missing information. To ensure fine-grained and consistent novel view and pose synthesis, we introduce a Human NeRF module in HumanGif to learn spatially aligned features from the input image, implicitly capturing the relative camera and human pose transformation. Furthermore, we introduce an image-level loss during optimization to bridge the gap between latent and image spaces in diffusion models. Extensive experiments on RenderPeople, DNA-Rendering, THuman 2.1, and TikTok datasets demonstrate that HumanGif achieves the best perceptual performance, with better generalizability for novel view and pose synthesis.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes