LGDec 2, 2022

Improved Representation Learning Through Tensorized Autoencoders

arXiv:2212.01046v12 citationsh-index: 13
Originality Incremental advance
AI Analysis

This work addresses the limitation of standard autoencoders in capturing cluster-specific structures for unsupervised representation learning, offering an incremental improvement for applications like clustering and de-noising.

The authors tackled the problem of learning cluster-specific representations in data with inherent cluster structures by proposing a tensorized autoencoder (TAE) meta-algorithm, which extends arbitrary AE architectures to simultaneously learn cluster assignments and embeddings, achieving improved performance in clustering and de-noising tasks as validated empirically.

The central question in representation learning is what constitutes a good or meaningful representation. In this work we argue that if we consider data with inherent cluster structures, where clusters can be characterized through different means and covariances, those data structures should be represented in the embedding as well. While Autoencoders (AE) are widely used in practice for unsupervised representation learning, they do not fulfil the above condition on the embedding as they obtain a single representation of the data. To overcome this we propose a meta-algorithm that can be used to extend an arbitrary AE architecture to a tensorized version (TAE) that allows for learning cluster-specific embeddings while simultaneously learning the cluster assignment. For the linear setting we prove that TAE can recover the principle components of the different clusters in contrast to principle component of the entire data recovered by a standard AE. We validated this on planted models and for general, non-linear and convolutional AEs we empirically illustrate that tensorizing the AE is beneficial in clustering and de-noising tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes