CVAug 17, 2021

MV-TON: Memory-based Video Virtual Try-on network

Xiaojing Zhong, Zhonghua Wu, Taizhe Tan, Guosheng Lin, Qingyao Wu

arXiv:2108.07502v141 citations

Originality Highly original

AI Analysis

This addresses the problem of realistic video try-on for e-commerce and virtual fitting applications, representing a novel method for a known bottleneck.

The paper tackles video-based virtual try-on by proposing MV-TON, which transfers clothes to a target person without clothing templates and generates high-resolution realistic videos, showing effectiveness and superiority over existing methods.

With the development of Generative Adversarial Network, image-based virtual try-on methods have made great progress. However, limited work has explored the task of video-based virtual try-on while it is important in real-world applications. Most existing video-based virtual try-on methods usually require clothing templates and they can only generate blurred and low-resolution results. To address these challenges, we propose a Memory-based Video virtual Try-On Network (MV-TON), which seamlessly transfers desired clothes to a target person without using any clothing templates and generates high-resolution realistic videos. Specifically, MV-TON consists of two modules: 1) a try-on module that transfers the desired clothes from model images to frame images by pose alignment and region-wise replacing of pixels; 2) a memory refinement module that learns to embed the existing generated frames into the latent space as external memory for the following frame generation. Experimental results show the effectiveness of our method in the video virtual try-on task and its superiority over other existing methods.

View on arXiv PDF

Similar