CVIRMMMay 24, 2024

Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval

arXiv:2405.15451v17 citationsh-index: 4ICASSP
Originality Incremental advance
AI Analysis

This work addresses a domain-specific problem in fashion retrieval, offering an incremental improvement over existing static fusion techniques.

The paper tackles the problem of language-based fashion image retrieval by proposing a Self-distilled Dynamic Fusion Network to dynamically fuse multi-granularity features, addressing limitations of static fusion methods. It demonstrates effectiveness through extensive experiments, though no concrete numbers are provided in the abstract.

In the domain of language-based fashion image retrieval, pinpointing the desired fashion item using both a reference image and its accompanying textual description is an intriguing challenge. Existing approaches lean heavily on static fusion techniques, intertwining image and text. Despite their commendable advancements, these approaches are still limited by a deficiency in flexibility. In response, we propose a Self-distilled Dynamic Fusion Network to compose the multi-granularity features dynamically by considering the consistency of routing path and modality-specific information simultaneously. Two new modules are included in our proposed method: (1) Dynamic Fusion Network with Modality Specific Routers. The dynamic network enables a flexible determination of the routing for each reference image and modification text, taking into account their distinct semantics and distributions. (2) Self Path Distillation Loss. A stable path decision for queries benefits the optimization of feature extraction as well as routing, and we approach this by progressively refine the path decision with previous path information. Extensive experiments demonstrate the effectiveness of our proposed model compared to existing methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes