CVNov 3, 2023

Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval

arXiv:2311.02122v15 citationsh-index: 4
Originality Highly original
AI Analysis

This pioneers a new facet in fashion recommendation systems by enabling outfit generation from text, addressing a gap for consumers seeking personalized style without pre-selected choices.

The paper tackles the problem of generating complete outfit sets based solely on textual descriptions, introducing a text-to-outfit retrieval task that outperforms state-of-the-art models on Maryland Polyvore and Polyvore Outfit datasets.

Fashion stylists have historically bridged the gap between consumers' desires and perfect outfits, which involve intricate combinations of colors, patterns, and materials. Although recent advancements in fashion recommendation systems have made strides in outfit compatibility prediction and complementary item retrieval, these systems rely heavily on pre-selected customer choices. Therefore, we introduce a groundbreaking approach to fashion recommendations: text-to-outfit retrieval task that generates a complete outfit set based solely on textual descriptions given by users. Our model is devised at three semantic levels-item, style, and outfit-where each level progressively aggregates data to form a coherent outfit recommendation based on textual input. Here, we leverage strategies similar to those in the contrastive language-image pretraining model to address the intricate-style matrix within the outfit sets. Using the Maryland Polyvore and Polyvore Outfit datasets, our approach significantly outperformed state-of-the-art models in text-video retrieval tasks, solidifying its effectiveness in the fashion recommendation domain. This research not only pioneers a new facet of fashion recommendation systems, but also introduces a method that captures the essence of individual style preferences through textual descriptions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes