Tailor: An Integrated Text-Driven CG-Ready Human and Garment Generation System
This addresses the need for accessible, integrated pipelines for creating ready-to-use clothed avatars, which is incremental as it builds on recent generative AI advances but offers a more comprehensive solution.
The paper tackles the problem of generating detailed 3D human avatars with garments from text descriptions by introducing Tailor, an integrated system that produces high-fidelity, customizable, and simulation-ready clothed avatars, outperforming existing state-of-the-art methods in fidelity, usability, and diversity.
Creating detailed 3D human avatars with garments typically requires specialized expertise and labor-intensive processes. Although recent advances in generative AI have enabled text-to-3D human/clothing generation, current methods fall short in offering accessible, integrated pipelines for producing ready-to-use clothed avatars. To solve this, we introduce Tailor, an integrated text-to-avatar system that generates high-fidelity, customizable 3D humans with simulation-ready garments. Our system includes a three-stage pipeline. We first employ a large language model to interpret textual descriptions into parameterized body shapes and semantically matched garment templates. Next, we develop topology-preserving deformation with novel geometric losses to adapt garments precisely to body geometries. Furthermore, an enhanced texture diffusion module with a symmetric local attention mechanism ensures both view consistency and photorealistic details. Quantitative and qualitative evaluations demonstrate that Tailor outperforms existing SoTA methods in terms of fidelity, usability, and diversity. Code will be available for academic use.