LGCVSep 26, 2021

Cluster Analysis with Deep Embeddings and Contrastive Learning

arXiv:2109.12714v25 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of unsupervised disentangled representation learning for computer vision researchers, offering an incremental improvement over existing clustering and contrastive learning methods.

The paper tackles unsupervised image clustering by proposing a framework that combines instance-level contrastive learning with deep embedding-based cluster center prediction, achieving a 7-8% improvement in Normalized Mutual Information (NMI) on CIFAR-10 with a score of 0.772 compared to state-of-the-art methods.

Unsupervised disentangled representation learning is a long-standing problem in computer vision. This work proposes a novel framework for performing image clustering from deep embeddings by combining instance-level contrastive learning with a deep embedding based cluster center predictor. Our approach jointly learns representations and predicts cluster centers in an end-to-end manner. This is accomplished via a three-pronged approach that combines a clustering loss, an instance-wise contrastive loss, and an anchor loss. Our fundamental intuition is that using an ensemble loss that incorporates instance-level features and a clustering procedure focusing on semantic similarity reinforces learning better representations in the latent space. We observe that our method performs exceptionally well on popular vision datasets when evaluated using standard clustering metrics such as Normalized Mutual Information (NMI), in addition to producing geometrically well-separated cluster embeddings as defined by the Euclidean distance. Our framework performs on par with widely accepted clustering methods and outperforms the state-of-the-art contrastive learning method on the CIFAR-10 dataset with an NMI score of 0.772, a 7-8% improvement on the strong baseline.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes