CVApr 15, 2024

Magic Clothing: Controllable Garment-Driven Image Synthesis

arXiv:2404.09512v231 citationsh-index: 10Has CodeMM
AI Analysis

This work addresses the need for controllable image synthesis in fashion and character design, though it is incremental as it builds upon existing latent diffusion models with novel modules.

The authors tackled the problem of generating customized characters wearing target garments with diverse text prompts by proposing Magic Clothing, a latent diffusion model-based network that preserves garment details and maintains faithfulness to text prompts, achieving state-of-the-art results in garment-driven image synthesis.

We propose Magic Clothing, a latent diffusion model (LDM)-based network architecture for an unexplored garment-driven image synthesis task. Aiming at generating customized characters wearing the target garments with diverse text prompts, the image controllability is the most critical issue, i.e., to preserve the garment details and maintain faithfulness to the text prompts. To this end, we introduce a garment extractor to capture the detailed garment features, and employ self-attention fusion to incorporate them into the pretrained LDMs, ensuring that the garment details remain unchanged on the target character. Then, we leverage the joint classifier-free guidance to balance the control of garment features and text prompts over the generated results. Meanwhile, the proposed garment extractor is a plug-in module applicable to various finetuned LDMs, and it can be combined with other extensions like ControlNet and IP-Adapter to enhance the diversity and controllability of the generated characters. Furthermore, we design Matched-Points-LPIPS (MP-LPIPS), a robust metric for evaluating the consistency of the target image to the source garment. Extensive experiments demonstrate that our Magic Clothing achieves state-of-the-art results under various conditional controls for garment-driven image synthesis. Our source code is available at https://github.com/ShineChen1024/MagicClothing.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes