CVSep 29, 2023

Information Flow in Self-Supervised Learning

Zhiquan Tan, Jingqin Yang, Weiran Huang, Yang Yuan, Yifan Zhang

arXiv:2309.17281v314.119 citationsh-index: 28Has Code

Originality Incremental advance

AI Analysis

It addresses the problem of understanding and improving self-supervised learning for computer vision researchers, with incremental contributions building on existing methods.

The paper analyzes self-supervised learning methods using matrix mutual information, revealing that loss functions optimize mutual information and joint entropy, and introduces M-MAE, which improves linear probing ViT-Base by 3.9% and fine-tuning ViT-Large by 1% on ImageNet.

In this paper, we conduct a comprehensive analysis of two dual-branch (Siamese architecture) self-supervised learning approaches, namely Barlow Twins and spectral contrastive learning, through the lens of matrix mutual information. We prove that the loss functions of these methods implicitly optimize both matrix mutual information and matrix joint entropy. This insight prompts us to further explore the category of single-branch algorithms, specifically MAE and U-MAE, for which mutual information and joint entropy become the entropy. Building on this intuition, we introduce the Matrix Variational Masked Auto-Encoder (M-MAE), a novel method that leverages the matrix-based estimation of entropy as a regularizer and subsumes U-MAE as a special case. The empirical evaluations underscore the effectiveness of M-MAE compared with the state-of-the-art methods, including a 3.9% improvement in linear probing ViT-Base, and a 1% improvement in fine-tuning ViT-Large, both on ImageNet.

View on arXiv PDF Code

Similar