CVAug 19, 2025

OmniTry: Virtual Try-On Anything without Masks

arXiv:2508.13632v16 citationsh-index: 3
Originality Incremental advance
AI Analysis

This work addresses a practical need for more versatile virtual try-on applications in e-commerce and fashion, though it is incremental by extending existing methods to new object types.

The paper tackles the problem of virtual try-on for any wearable objects beyond clothes, such as jewelry and accessories, without requiring masks, and achieves better performance on object localization and ID-preservation compared to existing methods, as evaluated on a benchmark of 12 classes.

Virtual Try-ON (VTON) is a practical and widely-applied task, for which most of existing works focus on clothes. This paper presents OmniTry, a unified framework that extends VTON beyond garment to encompass any wearable objects, e.g., jewelries and accessories, with mask-free setting for more practical application. When extending to various types of objects, data curation is challenging for obtaining paired images, i.e., the object image and the corresponding try-on result. To tackle this problem, we propose a two-staged pipeline: For the first stage, we leverage large-scale unpaired images, i.e., portraits with any wearable items, to train the model for mask-free localization. Specifically, we repurpose the inpainting model to automatically draw objects in suitable positions given an empty mask. For the second stage, the model is further fine-tuned with paired images to transfer the consistency of object appearance. We observed that the model after the first stage shows quick convergence even with few paired samples. OmniTry is evaluated on a comprehensive benchmark consisting of 12 common classes of wearable objects, with both in-shop and in-the-wild images. Experimental results suggest that OmniTry shows better performance on both object localization and ID-preservation compared with existing methods. The code, model weights, and evaluation benchmark of OmniTry will be made publicly available at https://omnitry.github.io/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes