CVLGAug 14, 2021

Unsupervised Disentanglement without Autoencoding: Pitfalls and Future Directions

arXiv:2108.06613v15 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the scalability issues of generative models for disentanglement in large datasets, but it is incremental as it highlights challenges without presenting a breakthrough solution.

The paper tackled the problem of achieving unsupervised disentangled visual representations without generative models, finding that regularization methods with contrastive learning face difficulties due to optimization sensitivity and trade-offs in task performance.

Disentangled visual representations have largely been studied with generative models such as Variational AutoEncoders (VAEs). While prior work has focused on generative methods for disentangled representation learning, these approaches do not scale to large datasets due to current limitations of generative models. Instead, we explore regularization methods with contrastive learning, which could result in disentangled representations that are powerful enough for large scale datasets and downstream applications. However, we find that unsupervised disentanglement is difficult to achieve due to optimization and initialization sensitivity, with trade-offs in task performance. We evaluate disentanglement with downstream tasks, analyze the benefits and disadvantages of each regularization used, and discuss future directions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes