CV LGOct 18, 2022

Towards Efficient and Effective Self-Supervised Learning of Visual Representations

Sravanti Addepalli, Kaushal Bhogale, Priyam Dey, R. Venkatesh Babu

arXiv:2210.09866v16.59 citationsh-index: 57Has Code

Originality Incremental advance

AI Analysis

This work addresses efficiency issues in self-supervised learning for computer vision, offering an incremental improvement to existing methods.

The paper tackles the slow convergence of self-supervised visual representation learning methods by proposing to strengthen them with auxiliary tasks like rotation prediction, resulting in significant performance gains, especially at lower training epochs, as demonstrated on multiple datasets.

Self-supervision has emerged as a propitious method for visual representation learning after the recent paradigm shift from handcrafted pretext tasks to instance-similarity based approaches. Most state-of-the-art methods enforce similarity between various augmentations of a given image, while some methods additionally use contrastive approaches to explicitly ensure diverse representations. While these approaches have indeed shown promising direction, they require a significantly larger number of training iterations when compared to the supervised counterparts. In this work, we explore reasons for the slow convergence of these methods, and further propose to strengthen them using well-posed auxiliary tasks that converge significantly faster, and are also useful for representation learning. The proposed method utilizes the task of rotation prediction to improve the efficiency of existing state-of-the-art methods. We demonstrate significant gains in performance using the proposed method on multiple datasets, specifically for lower training epochs.

View on arXiv PDF Code

Similar