CVJan 20

GO-MLVTON: Garment Occlusion-Aware Multi-Layer Virtual Try-On with Diffusion Models

arXiv:2601.13524v11 citationsh-index: 7
Originality Highly original
AI Analysis

This addresses the challenge of realistic multi-layer garment try-on for virtual fashion applications, representing a novel approach in the field.

The paper tackles the problem of multi-layer virtual try-on (ML-VTON) by proposing GO-MLVTON, which uses a Garment Occlusion Learning module and StableDiffusion-based Garment Morphing & Fitting to generate realistic multi-layer garment images, achieving state-of-the-art performance as demonstrated in experiments.

Existing Image-based virtual try-on (VTON) methods primarily focus on single-layer or multi-garment VTON, neglecting multi-layer VTON (ML-VTON), which involves dressing multiple layers of garments onto the human body with realistic deformation and layering to generate visually plausible outcomes. The main challenge lies in accurately modeling occlusion relationships between inner and outer garments to reduce interference from redundant inner garment features. To address this, we propose GO-MLVTON, the first multi-layer VTON method, introducing the Garment Occlusion Learning module to learn occlusion relationships and the StableDiffusion-based Garment Morphing & Fitting module to deform and fit garments onto the human body, producing high-quality multi-layer try-on results. Additionally, we present the MLG dataset for this task and propose a new metric named Layered Appearance Coherence Difference (LACD) for evaluation. Extensive experiments demonstrate the state-of-the-art performance of GO-MLVTON. Project page: https://upyuyang.github.io/go-mlvton/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes