LGAICVNEROSep 29, 2022

Understanding Collapse in Non-Contrastive Siamese Representation Learning

BerkeleyCMU
arXiv:2209.15007v247 citationsh-index: 111
Originality Incremental advance
AI Analysis

This addresses a critical issue in self-supervised representation learning for computer vision, offering insights and improvements to prevent collapse, though it is incremental as it builds on existing non-contrastive methods.

The paper tackles the problem of dimensional collapse in non-contrastive self-supervised learning methods like SimSiam, finding that they are sensitive to dataset and model size, and proposes a metric to measure collapse and a hybrid continual-multi-epoch training approach that improves linear probe accuracy by up to 18 percentage points on ImageNet.

Contrastive methods have led a recent surge in the performance of self-supervised representation learning (SSL). Recent methods like BYOL or SimSiam purportedly distill these contrastive methods down to their essence, removing bells and whistles, including the negative examples, that do not contribute to downstream performance. These "non-contrastive" methods work surprisingly well without using negatives even though the global minimum lies at trivial collapse. We empirically analyze these non-contrastive methods and find that SimSiam is extraordinarily sensitive to dataset and model size. In particular, SimSiam representations undergo partial dimensional collapse if the model is too small relative to the dataset size. We propose a metric to measure the degree of this collapse and show that it can be used to forecast the downstream task performance without any fine-tuning or labels. We further analyze architectural design choices and their effect on the downstream performance. Finally, we demonstrate that shifting to a continual learning setting acts as a regularizer and prevents collapse, and a hybrid between continual and multi-epoch training can improve linear probe accuracy by as many as 18 percentage points using ResNet-18 on ImageNet. Our project page is at https://alexanderli.com/noncontrastive-ssl/.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes