CV AIMar 27, 2022

CaCo: Both Positive and Negative Samples are Directly Learnable via Cooperative-adversarial Contrastive Learning

Xiao Wang, Yuhang Huang, Dan Zeng, Guo-Jun Qi

arXiv:2203.14370v19.428 citationsh-index: 23Has Code

Originality Highly original

AI Analysis

This addresses the challenge of sample selection in self-supervised learning for computer vision, offering a principled approach that improves representation learning.

The paper tackles the problem of heuristically choosing positive and negative samples in contrastive learning by proposing a method where both are directly learnable end-to-end with the encoder, achieving 71.3% and 75.3% top-1 accuracy on ImageNet1K with ResNet-50 over 200 and 800 epochs, respectively.

As a representative self-supervised method, contrastive learning has achieved great successes in unsupervised training of representations. It trains an encoder by distinguishing positive samples from negative ones given query anchors. These positive and negative samples play critical roles in defining the objective to learn the discriminative encoder, avoiding it from learning trivial features. While existing methods heuristically choose these samples, we present a principled method where both positive and negative samples are directly learnable end-to-end with the encoder. We show that the positive and negative samples can be cooperatively and adversarially learned by minimizing and maximizing the contrastive loss, respectively. This yields cooperative positives and adversarial negatives with respect to the encoder, which are updated to continuously track the learned representation of the query anchors over mini-batches. The proposed method achieves 71.3% and 75.3% in top-1 accuracy respectively over 200 and 800 epochs of pre-training ResNet-50 backbone on ImageNet1K without tricks such as multi-crop or stronger augmentations. With Multi-Crop, it can be further boosted into 75.7%. The source code and pre-trained model are released in https://github.com/maple-research-lab/caco.

View on arXiv PDF Code

Similar