CV LG IVOct 13, 2020

Are all negatives created equal in contrastive instance discrimination?

Tiffany Tianhui Cai, Jonathan Frankle, David J. Schwab, Ari S. Morcos

arXiv:2010.06682v221.697 citations

Originality Incremental advance

AI Analysis

This work addresses the problem of inefficient negative sampling in self-supervised learning for computer vision, offering insights that could lead to more effective training methods.

The study investigated the importance of negative sample difficulty in contrastive instance discrimination, finding that only the hardest 5% of negatives were necessary and sufficient for achieving nearly full accuracy in downstream tasks, while the easiest 95% were unnecessary and insufficient.

Self-supervised learning has recently begun to rival supervised learning on computer vision tasks. Many of the recent approaches have been based on contrastive instance discrimination (CID), in which the network is trained to recognize two augmented versions of the same instance (a query and positive) while discriminating against a pool of other instances (negatives). The learned representation is then used on downstream tasks such as image classification. Using methodology from MoCo v2 (Chen et al., 2020), we divided negatives by their difficulty for a given query and studied which difficulty ranges were most important for learning useful representations. We found a minority of negatives -- the hardest 5% -- were both necessary and sufficient for the downstream task to reach nearly full accuracy. Conversely, the easiest 95% of negatives were unnecessary and insufficient. Moreover, the very hardest 0.1% of negatives were unnecessary and sometimes detrimental. Finally, we studied the properties of negatives that affect their hardness, and found that hard negatives were more semantically similar to the query, and that some negatives were more consistently easy or hard than we would expect by chance. Together, our results indicate that negatives vary in importance and that CID may benefit from more intelligent negative treatment.

View on arXiv PDF

Similar