LGMLNov 25, 2019

Towards Better Understanding of Disentangled Representations via Mutual Information

arXiv:1911.10922v36 citations
Originality Incremental advance
AI Analysis

This work addresses a foundational issue in representation learning for machine learning researchers, but it is incremental as it builds on existing independence assumptions.

The authors tackled the problem of characterizing disentangled representations by proposing that mutual information between each factor and the data should be invariant conditioned on other factors, and they showed experimentally that violating this assumption leads to a decline in disentanglement.

Most existing works on disentangled representation learning are solely built upon an marginal independence assumption: all factors in disentangled representations should be statistically independent. This assumption is necessary but definitely not sufficient for the disentangled representations without additional inductive biases in the modeling process, which is shown theoretically in recent studies. We argue in this work that disentangled representations should be characterized by their relation with observable data. In particular, we formulate such a relation through the concept of mutual information: the mutual information between each factor of the disentangled representations and data should be invariant conditioned on values of the other factors. Together with the widely accepted independence assumption, we further bridge it with the conditional independence of factors in representations conditioned on data. Moreover, we note that conditional independence of latent variables has been imposed on most VAE-type models and InfoGAN due to the artificial choice of factorized approximate posterior $q(\rvz|\rvx)$ in the encoders. Such an arrangement of encoders introduces a crucial inductive bias for disentangled representations. To demonstrate the importance of our proposed assumption and the related inductive bias, we show in experiments that violating the assumption leads to decline of disentanglement among factors in the learned representations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes