MLAILGApr 10, 2018

Understanding disentangling in $β$-VAE

arXiv:1804.03599v1941 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of achieving disentangled representations in variational autoencoders, which is important for interpretable machine learning, though it is incremental as it builds on existing β-VAE methods.

The paper tackles the problem of understanding and improving disentangled representation learning in β-VAEs by providing new theoretical insights and proposing a training modification that progressively increases latent code capacity, resulting in robust disentanglement without sacrificing reconstruction accuracy.

We present new intuitions and theoretical assessments of the emergence of disentangled representation in variational autoencoders. Taking a rate-distortion theory perspective, we show the circumstances under which representations aligned with the underlying generative factors of variation of data emerge when optimising the modified ELBO bound in $β$-VAE, as training progresses. From these insights, we propose a modification to the training regime of $β$-VAE, that progressively increases the information capacity of the latent code during training. This modification facilitates the robust learning of disentangled representations in $β$-VAE, without the previous trade-off in reconstruction accuracy.

Code Implementations23 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes