LGOct 4, 2023

FroSSL: Frobenius Norm Minimization for Efficient Multiview Self-Supervised Learning

arXiv:2310.02903v42 citationsh-index: 8
Originality Incremental advance
AI Analysis

This work addresses efficiency issues in SSL for computer vision, offering a faster training method that is incremental in combining existing approaches without eigendecomposition.

The paper tackles the problem of epoch inefficiency in multiview self-supervised learning by introducing FroSSL, an objective function that minimizes covariance Frobenius norms to avoid collapse and mean-squared error for augmentation invariance, achieving competitive accuracies more quickly than other methods.

Self-supervised learning (SSL) is a popular paradigm for representation learning. Recent multiview methods can be classified as sample-contrastive, dimension-contrastive, or asymmetric network-based, with each family having its own approach to avoiding informational collapse. While these families converge to solutions of similar quality, it can be empirically shown that some methods are epoch-inefficient and require longer training to reach a target performance. Two main approaches to improving efficiency are covariance eigenvalue regularization and using more views. However, these two approaches are difficult to combine due to the computational complexity of computing eigenvalues. We present the objective function FroSSL which reconciles both approaches while avoiding eigendecomposition entirely. FroSSL works by minimizing covariance Frobenius norms to avoid collapse and minimizing mean-squared error to achieve augmentation invariance. We show that FroSSL reaches competitive accuracies more quickly than any other SSL method and provide theoretical and empirical support that this faster convergence is due to how FroSSL affects the eigenvalues of the embedding covariance matrices. We also show that FroSSL learns competitive representations on linear probe evaluation when used to train a ResNet-18 on several datasets, including STL-10, Tiny ImageNet, and ImageNet-100.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes