ML AI LGApr 10, 2018

Understanding disentangling in $β$-VAE

Christopher P. Burgess, Irina Higgins, Arka Pal, Loic Matthey, Nick Watters, Guillaume Desjardins, Alexander Lerchner

arXiv:1804.03599v145.9941 citations

Originality Incremental advance

AI Analysis

This work addresses the challenge of achieving disentangled representations in variational autoencoders, which is important for interpretable machine learning, though it is incremental as it builds on existing β-VAE methods.

The paper tackles the problem of understanding and improving disentangled representation learning in β-VAEs by providing new theoretical insights and proposing a training modification that progressively increases latent code capacity, resulting in robust disentanglement without sacrificing reconstruction accuracy.

We present new intuitions and theoretical assessments of the emergence of disentangled representation in variational autoencoders. Taking a rate-distortion theory perspective, we show the circumstances under which representations aligned with the underlying generative factors of variation of data emerge when optimising the modified ELBO bound in $β$-VAE, as training progresses. From these insights, we propose a modification to the training regime of $β$-VAE, that progressively increases the information capacity of the latent code during training. This modification facilitates the robust learning of disentangled representations in $β$-VAE, without the previous trade-off in reconstruction accuracy.

View on arXiv PDF

Similar