CVMar 25, 2024

DPStyler: Dynamic PromptStyler for Source-Free Domain Generalization

arXiv:2403.16697v217 citationsh-index: 5IEEE transactions on multimedia
Originality Incremental advance
AI Analysis

This addresses domain generalization for computer vision applications where source data is unavailable, representing an incremental improvement over existing methods.

The paper tackles source-free domain generalization by introducing DPStyler, which uses dynamic style generation and removal modules to simulate diverse domain styles without source data, achieving state-of-the-art performance on benchmark datasets.

Source-Free Domain Generalization (SFDG) aims to develop a model that works for unseen target domains without relying on any source domain. Research in SFDG primarily bulids upon the existing knowledge of large-scale vision-language models and utilizes the pre-trained model's joint vision-language space to simulate style transfer across domains, thus eliminating the dependency on source domain images. However, how to efficiently simulate rich and diverse styles using text prompts, and how to extract domain-invariant information useful for classification from features that contain both semantic and style information after the encoder, are directions that merit improvement. In this paper, we introduce Dynamic PromptStyler (DPStyler), comprising Style Generation and Style Removal modules to address these issues. The Style Generation module refreshes all styles at every training epoch, while the Style Removal module eliminates variations in the encoder's output features caused by input styles. Moreover, since the Style Generation module, responsible for generating style word vectors using random sampling or style mixing, makes the model sensitive to input text prompts, we introduce a model ensemble method to mitigate this sensitivity. Extensive experiments demonstrate that our framework outperforms state-of-the-art methods on benchmark datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes