LGAIMLMay 3, 2019

Disentangling Factors of Variation Using Few Labels

arXiv:1905.01258v2128 citations
AI Analysis

This addresses the challenge of disentanglement learning for representation learning practitioners by providing a practical, data-efficient solution, though it is incremental as it builds on existing unsupervised methods with added supervision.

The paper tackles the problem of learning disentangled representations by showing that a small amount of supervision (0.01-0.5% of labeled data) enables reliable model selection and training, empirically validating that such limited and imprecise labels can achieve consistent disentanglement.

Learning disentangled representations is considered a cornerstone problem in representation learning. Recently, Locatello et al. (2019) demonstrated that unsupervised disentanglement learning without inductive biases is theoretically impossible and that existing inductive biases and unsupervised methods do not allow to consistently learn disentangled representations. However, in many practical settings, one might have access to a limited amount of supervision, for example through manual labeling of (some) factors of variation in a few training examples. In this paper, we investigate the impact of such supervision on state-of-the-art disentanglement methods and perform a large scale study, training over 52000 models under well-defined and reproducible experimental conditions. We observe that a small number of labeled examples (0.01--0.5\% of the data set), with potentially imprecise and incomplete labels, is sufficient to perform model selection on state-of-the-art unsupervised models. Further, we investigate the benefit of incorporating supervision into the training process. Overall, we empirically validate that with little and imprecise supervision it is possible to reliably learn disentangled representations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes