MLLGJul 15, 2022

Partial Disentanglement via Mechanism Sparsity

arXiv:2207.07732v132 citationsh-index: 47
Originality Incremental advance
AI Analysis

This work addresses the challenge of extracting interpretable latent representations in machine learning, particularly for causal inference, but it is incremental as it builds on existing theory with a broader applicability.

The paper tackles the problem of unsupervised disentanglement of latent factors in causal graphs by generalizing a prior theory that required a specific graphical criterion, introducing a new equivalence relation called consistency to predict which factors remain entangled based on any ground-truth graph, resulting in a framework for partial disentanglement.

Disentanglement via mechanism sparsity was introduced recently as a principled approach to extract latent factors without supervision when the causal graph relating them in time is sparse, and/or when actions are observed and affect them sparsely. However, this theory applies only to ground-truth graphs satisfying a specific criterion. In this work, we introduce a generalization of this theory which applies to any ground-truth graph and specifies qualitatively how disentangled the learned representation is expected to be, via a new equivalence relation over models we call consistency. This equivalence captures which factors are expected to remain entangled and which are not based on the specific form of the ground-truth graph. We call this weaker form of identifiability partial disentanglement. The graphical criterion that allows complete disentanglement, proposed in an earlier work, can be derived as a special case of our theory. Finally, we enforce graph sparsity with constrained optimization and illustrate our theory and algorithm in simulations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes