CVJun 24, 2025

Self-Paced Collaborative and Adversarial Network for Unsupervised Domain Adaptation

arXiv:2506.19267v145 citationsh-index: 98IEEE Trans Pattern Anal Mach Intell
Originality Incremental advance
AI Analysis

This addresses the problem of adapting models from labeled source domains to unlabeled target domains in computer vision, with incremental improvements over existing methods.

The paper tackles unsupervised domain adaptation by proposing a Collaborative and Adversarial Network (CAN) and its self-paced variant (SPCAN), which combine domain-collaborative and domain-adversarial learning to improve feature representation, achieving state-of-the-art performance on benchmark datasets like Office-31 and VISDA-2017.

This paper proposes a new unsupervised domain adaptation approach called Collaborative and Adversarial Network (CAN), which uses the domain-collaborative and domain-adversarial learning strategy for training the neural network. The domain-collaborative learning aims to learn domain-specific feature representation to preserve the discriminability for the target domain, while the domain adversarial learning aims to learn domain-invariant feature representation to reduce the domain distribution mismatch between the source and target domains. We show that these two learning strategies can be uniformly formulated as domain classifier learning with positive or negative weights on the losses. We then design a collaborative and adversarial training scheme, which automatically learns domain-specific representations from lower blocks in CNNs through collaborative learning and domain-invariant representations from higher blocks through adversarial learning. Moreover, to further enhance the discriminability in the target domain, we propose Self-Paced CAN (SPCAN), which progressively selects pseudo-labeled target samples for re-training the classifiers. We employ a self-paced learning strategy to select pseudo-labeled target samples in an easy-to-hard fashion. Comprehensive experiments on different benchmark datasets, Office-31, ImageCLEF-DA, and VISDA-2017 for the object recognition task, and UCF101-10 and HMDB51-10 for the video action recognition task, show our newly proposed approaches achieve the state-of-the-art performance, which clearly demonstrates the effectiveness of our proposed approaches for unsupervised domain adaptation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes