CVApr 24, 2022

RMGN: A Regional Mask Guided Network for Parser-free Virtual Try-on

arXiv:2204.11258v111 citationsh-index: 11
Originality Incremental advance
AI Analysis

This addresses a key limitation in e-commerce virtual try-on by improving detail synthesis without relying on parser information, though it is incremental as it builds on existing parser-free methods.

The paper tackles the problem of detail synthesis in parser-free virtual try-on, where distractions from original clothing persist in complex postures and high-resolution images, by proposing RMGN, which uses a regional mask to fuse features and achieves state-of-the-art performance in experiments.

Virtual try-on(VTON) aims at fitting target clothes to reference person images, which is widely adopted in e-commerce.Existing VTON approaches can be narrowly categorized into Parser-Based(PB) and Parser-Free(PF) by whether relying on the parser information to mask the persons' clothes and synthesize try-on images. Although abandoning parser information has improved the applicability of PF methods, the ability of detail synthesizing has also been sacrificed. As a result, the distraction from original cloth may persistin synthesized images, especially in complicated postures and high resolution applications. To address the aforementioned issue, we propose a novel PF method named Regional Mask Guided Network(RMGN). More specifically, a regional mask is proposed to explicitly fuse the features of target clothes and reference persons so that the persisted distraction can be eliminated. A posture awareness loss and a multi-level feature extractor are further proposed to handle the complicated postures and synthesize high resolution images. Extensive experiments demonstrate that our proposed RMGN outperforms both state-of-the-art PB and PF methods.Ablation studies further verify the effectiveness ofmodules in RMGN.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes