CVNov 10, 2025

AvatarTex: High-Fidelity Facial Texture Reconstruction from Single-Image Stylized Avatars

arXiv:2511.06721v11 citationsh-index: 6
Originality Highly original
AI Analysis

This addresses the challenge of generating consistent and diverse facial textures for stylized avatars in computer graphics and virtual reality, representing a strong specific gain rather than a foundational advancement.

The paper tackles the problem of reconstructing high-fidelity facial textures from single-image stylized avatars, which existing methods struggle with due to dataset limitations and geometric inconsistencies, and achieves state-of-the-art results by introducing a novel three-stage diffusion-to-GAN pipeline and a new dataset of 20,000 multi-style UV textures.

We present AvatarTex, a high-fidelity facial texture reconstruction framework capable of generating both stylized and photorealistic textures from a single image. Existing methods struggle with stylized avatars due to the lack of diverse multi-style datasets and challenges in maintaining geometric consistency in non-standard textures. To address these limitations, AvatarTex introduces a novel three-stage diffusion-to-GAN pipeline. Our key insight is that while diffusion models excel at generating diversified textures, they lack explicit UV constraints, whereas GANs provide a well-structured latent space that ensures style and topology consistency. By integrating these strengths, AvatarTex achieves high-quality topology-aligned texture synthesis with both artistic and geometric coherence. Specifically, our three-stage pipeline first completes missing texture regions via diffusion-based inpainting, refines style and structure consistency using GAN-based latent optimization, and enhances fine details through diffusion-based repainting. To address the need for a stylized texture dataset, we introduce TexHub, a high-resolution collection of 20,000 multi-style UV textures with precise UV-aligned layouts. By leveraging TexHub and our structured diffusion-to-GAN pipeline, AvatarTex establishes a new state-of-the-art in multi-style facial texture reconstruction. TexHub will be released upon publication to facilitate future research in this field.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes