LGFeb 13, 2025

On the Importance of Embedding Norms in Self-Supervised Learning

arXiv:2502.09252v24 citationsh-index: 6ICML
Originality Highly original
AI Analysis

This research tackles the problem of understanding the importance of embedding norms in self-supervised learning for machine learning practitioners and researchers.

The authors investigated the role of embedding norms in self-supervised learning, finding that they govern convergence rates and encode network confidence, with smaller norms corresponding to unexpected samples. Embedding norms can have large effects on convergence speed.

Self-supervised learning (SSL) allows training data representations without a supervised signal and has become an important paradigm in machine learning. Most SSL methods employ the cosine similarity between embedding vectors and hence effectively embed data on a hypersphere. While this seemingly implies that embedding norms cannot play any role in SSL, a few recent works have suggested that embedding norms have properties related to network convergence and confidence. In this paper, we resolve this apparent contradiction and systematically establish the embedding norm's role in SSL training. Using theoretical analysis, simulations, and experiments, we show that embedding norms (i) govern SSL convergence rates and (ii) encode network confidence, with smaller norms corresponding to unexpected samples. Additionally, we show that manipulating embedding norms can have large effects on convergence speed. Our findings demonstrate that SSL embedding norms are integral to understanding and optimizing network behavior.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes