CVAug 31, 2024

COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation

arXiv:2409.00397v25 citationsh-index: 23Has Code
Originality Incremental advance
AI Analysis

This addresses a more realistic scenario in domain adaptation for computer vision, though it appears incremental as it builds on existing CLIP and DA methods.

The paper tackles the problem of Open-Set Multi-Target Domain Adaptation (OSMTDA), where existing methods struggle with domain and class shifts, by introducing COSMo, a method that learns domain-agnostic prompts using CLIP, resulting in an average improvement of 5.1% across three datasets.

Multi-Target Domain Adaptation (MTDA) entails learning domain-invariant information from a single source domain and applying it to multiple unlabeled target domains. Yet, existing MTDA methods predominantly focus on addressing domain shifts within visual features, often overlooking semantic features and struggling to handle unknown classes, resulting in what is known as Open-Set (OS) MTDA. While large-scale vision-language foundation models like CLIP show promise, their potential for MTDA remains largely unexplored. This paper introduces COSMo, a novel method that learns domain-agnostic prompts through source domain-guided prompt learning to tackle the MTDA problem in the prompt space. By leveraging a domain-specific bias network and separate prompts for known and unknown classes, COSMo effectively adapts across domain and class shifts. To the best of our knowledge, COSMo is the first method to address Open-Set Multi-Target DA (OSMTDA), offering a more realistic representation of real-world scenarios and addressing the challenges of both open-set and multi-target DA. COSMo demonstrates an average improvement of $5.1\%$ across three challenging datasets: Mini-DomainNet, Office-31, and Office-Home, compared to other related DA methods adapted to operate within the OSMTDA setting. Code is available at: https://github.com/munish30monga/COSMo

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes