CVJul 30, 2022

Improving Fine-tuning of Self-supervised Models with Contrastive Initialization

arXiv:2208.00238v125 citationsh-index: 53
Originality Incremental advance
AI Analysis

This addresses a bottleneck in fine-tuning self-supervised models for downstream tasks, offering an incremental improvement.

The paper tackles the problem that self-supervised models may not capture meaningful semantic information due to contrastive loss treating same-class images as negative pairs, which hampers fine-tuning; the proposed Contrastive Initialization (COIN) method significantly outperforms existing methods on multiple downstream tasks without extra training cost.

Self-supervised learning (SSL) has achieved remarkable performance in pretraining the models that can be further used in downstream tasks via fine-tuning. However, these self-supervised models may not capture meaningful semantic information since the images belonging to the same class are always regarded as negative pairs in the contrastive loss. Consequently, the images of the same class are often located far away from each other in learned feature space, which would inevitably hamper the fine-tuning process. To address this issue, we seek to provide a better initialization for the self-supervised models by enhancing the semantic information. To this end, we propose a Contrastive Initialization (COIN) method that breaks the standard fine-tuning pipeline by introducing an extra initialization stage before fine-tuning. Extensive experiments show that, with the enriched semantics, our COIN significantly outperforms existing methods without introducing extra training cost and sets new state-of-the-arts on multiple downstream tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes