CVMar 25, 2024

Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text

arXiv:2403.16897v18 citationsh-index: 10CVPR
Originality Incremental advance
AI Analysis

This addresses the problem of creating vivid and charming textures for animatable cartoon characters in applications like animation and gaming, representing a novel domain-specific advancement.

The paper tackles automatic texture design for 3D biped cartoon characters from text instructions, achieving high-quality texture generation in UV space with improved performance over current methods, as shown in extensive experiments.

Creating and animating 3D biped cartoon characters is crucial and valuable in various applications. Compared with geometry, the diverse texture design plays an important role in making 3D biped cartoon characters vivid and charming. Therefore, we focus on automatic texture design for cartoon characters based on input instructions. This is challenging for domain-specific requirements and a lack of high-quality data. To address this challenge, we propose Make-It-Vivid, the first attempt to enable high-quality texture generation from text in UV space. We prepare a detailed text-texture paired data for 3D characters by using vision-question-answering agents. Then we customize a pretrained text-to-image model to generate texture map with template structure while preserving the natural 2D image knowledge. Furthermore, to enhance fine-grained details, we propose a novel adversarial learning scheme to shorten the domain gap between original dataset and realistic texture domain. Extensive experiments show that our approach outperforms current texture generation methods, resulting in efficient character texturing and faithful generation with prompts. Besides, we showcase various applications such as out of domain generation and texture stylization. We also provide an efficient generation system for automatic text-guided textured character generation and animation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes