CV LGAug 14, 2021

Unsupervised Disentanglement without Autoencoding: Pitfalls and Future Directions

Andrea Burns, Aaron Sarna, Dilip Krishnan, Aaron Maschinot

arXiv:2108.06613v13.75 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This work addresses the scalability issues of generative models for disentanglement in large datasets, but it is incremental as it highlights challenges without presenting a breakthrough solution.

The paper tackled the problem of achieving unsupervised disentangled visual representations without generative models, finding that regularization methods with contrastive learning face difficulties due to optimization sensitivity and trade-offs in task performance.

Disentangled visual representations have largely been studied with generative models such as Variational AutoEncoders (VAEs). While prior work has focused on generative methods for disentangled representation learning, these approaches do not scale to large datasets due to current limitations of generative models. Instead, we explore regularization methods with contrastive learning, which could result in disentangled representations that are powerful enough for large scale datasets and downstream applications. However, we find that unsupervised disentanglement is difficult to achieve due to optimization and initialization sensitivity, with trade-offs in task performance. We evaluate disentanglement with downstream tasks, analyze the benefits and disadvantages of each regularization used, and discuss future directions.

View on arXiv PDF Code

Similar