LGAIFeb 5, 2023

Learning Interpretable Low-dimensional Representation via Physical Symmetry

arXiv:2302.10890v45 citationsh-index: 24
Originality Highly original
AI Analysis

This addresses the challenge of discovering general computational principles for interpretable representations in fields like music and computer vision, offering a novel approach that reduces reliance on domain knowledge.

The paper tackled the problem of learning interpretable low-dimensional representations from time-series data by using physical symmetry as a self-consistency constraint, resulting in the model learning a linear pitch factor from unlabelled monophonic music audio and a 3D Cartesian space from videos of a moving object without labels.

We have recently seen great progress in learning interpretable music representations, ranging from basic factors, such as pitch and timbre, to high-level concepts, such as chord and texture. However, most methods rely heavily on music domain knowledge. It remains an open question what general computational principles give rise to interpretable representations, especially low-dim factors that agree with human perception. In this study, we take inspiration from modern physics and use physical symmetry as a self consistency constraint for the latent space of time-series data. Specifically, it requires the prior model that characterises the dynamics of the latent states to be equivariant with respect to certain group transformations. We show that physical symmetry leads the model to learn a linear pitch factor from unlabelled monophonic music audio in a self-supervised fashion. In addition, the same methodology can be applied to computer vision, learning a 3D Cartesian space from videos of a simple moving object without labels. Furthermore, physical symmetry naturally leads to counterfactual representation augmentation, a new technique which improves sample efficiency.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes