CVLGOct 18, 2022

Towards Efficient and Effective Self-Supervised Learning of Visual Representations

arXiv:2210.09866v19 citationsh-index: 57
Originality Incremental advance
AI Analysis

This work addresses efficiency issues in self-supervised learning for computer vision, offering an incremental improvement to existing methods.

The paper tackles the slow convergence of self-supervised visual representation learning methods by proposing to strengthen them with auxiliary tasks like rotation prediction, resulting in significant performance gains, especially at lower training epochs, as demonstrated on multiple datasets.

Self-supervision has emerged as a propitious method for visual representation learning after the recent paradigm shift from handcrafted pretext tasks to instance-similarity based approaches. Most state-of-the-art methods enforce similarity between various augmentations of a given image, while some methods additionally use contrastive approaches to explicitly ensure diverse representations. While these approaches have indeed shown promising direction, they require a significantly larger number of training iterations when compared to the supervised counterparts. In this work, we explore reasons for the slow convergence of these methods, and further propose to strengthen them using well-posed auxiliary tasks that converge significantly faster, and are also useful for representation learning. The proposed method utilizes the task of rotation prediction to improve the efficiency of existing state-of-the-art methods. We demonstrate significant gains in performance using the proposed method on multiple datasets, specifically for lower training epochs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes