LGAICVOct 27, 2021

GenURL: A General Framework for Unsupervised Representation Learning

arXiv:2110.14553v413 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of fragmented algorithm development in unsupervised representation learning for researchers and practitioners, offering a unified approach that is incremental but broad in scope.

The paper tackles the lack of a unified framework for unsupervised representation learning (URL) across different tasks by proposing GenURL, a general similarity-based framework that adapts to various URL tasks, achieving state-of-the-art performance in self-supervised visual learning, unsupervised knowledge distillation, graph embeddings, and dimension reduction.

Unsupervised representation learning (URL), which learns compact embeddings of high-dimensional data without supervision, has made remarkable progress recently. However, the development of URLs for different requirements is independent, which limits the generalization of the algorithms, especially prohibitive as the number of tasks grows. For example, dimension reduction methods, t-SNE, and UMAP optimize pair-wise data relationships by preserving the global geometric structure, while self-supervised learning, SimCLR, and BYOL focus on mining the local statistics of instances under specific augmentations. To address this dilemma, we summarize and propose a unified similarity-based URL framework, GenURL, which can smoothly adapt to various URL tasks. In this paper, we regard URL tasks as different implicit constraints on the data geometric structure that help to seek optimal low-dimensional representations that boil down to data structural modeling (DSM) and low-dimensional transformation (LDT). Specifically, DMS provides a structure-based submodule to describe the global structures, and LDT learns compact low-dimensional embeddings with given pretext tasks. Moreover, an objective function, General Kullback-Leibler divergence (GKL), is proposed to connect DMS and LDT naturally. Comprehensive experiments demonstrate that GenURL achieves consistent state-of-the-art performance in self-supervised visual learning, unsupervised knowledge distillation (KD), graph embeddings (GE), and dimension reduction.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes