TriBYOL: Triplet BYOL for Self-Supervised Representation Learning
It provides a feasible solution for self-supervised learning with real-world high-resolution images using small batch sizes, addressing a computational bottleneck for researchers and practitioners.
The paper tackles the problem of self-supervised learning requiring large batch sizes and heavy computational resources by proposing a triplet network with a triple-view loss, resulting in drastically outperforming state-of-the-art methods on several datasets in small-batch cases.
This paper proposes a novel self-supervised learning method for learning better representations with small batch sizes. Many self-supervised learning methods based on certain forms of the siamese network have emerged and received significant attention. However, these methods need to use large batch sizes to learn good representations and require heavy computational resources. We present a new triplet network combined with a triple-view loss to improve the performance of self-supervised representation learning with small batch sizes. Experimental results show that our method can drastically outperform state-of-the-art self-supervised learning methods on several datasets in small-batch cases. Our method provides a feasible solution for self-supervised learning with real-world high-resolution images that uses small batch sizes.