LGROMLMar 4, 2020

q-VAE for Disentangled Representation Learning and Latent Dynamical Systems

arXiv:2003.01852v324 citations
AI Analysis

This work addresses the challenge of making robots controllable with feasible computational resources by enhancing latent space usefulness, though it appears incremental as it builds upon existing methods like β-VAE.

The paper tackles the problem of learning disentangled representations and latent dynamics for robotic control by proposing a q-VAE based on Tsallis statistics, which improves disentanglement while maintaining reconstruction accuracy on MNIST and enables stable long-term state prediction in nonlinear systems.

A variational autoencoder (VAE) derived from Tsallis statistics called q-VAE is proposed. In the proposed method, a standard VAE is employed to statistically extract latent space hidden in sampled data, and this latent space helps make robots controllable in feasible computational time and cost. To improve the usefulness of the latent space, this paper focuses on disentangled representation learning, e.g., $β$-VAE, which is the baseline for it. Starting from a Tsallis statistics perspective, a new lower bound for the proposed q-VAE is derived to maximize the likelihood of the sampled data, which can be considered an adaptive $β$-VAE with deformed Kullback-Leibler divergence. To verify the benefits of the proposed q-VAE, a benchmark task to extract the latent space from the MNIST dataset was performed. The results demonstrate that the proposed q-VAE improved disentangled representation while maintaining the reconstruction accuracy of the data. In addition, it relaxes the independency condition between data, which is demonstrated by learning the latent dynamics of nonlinear dynamical systems. By combining disentangled representation, the proposed q-VAE achieves stable and accurate long-term state prediction from the initial state and the action sequence. The dataset for hexapod walking is available on IEEE Dataport, doi: https://dx.doi.org/10.21227/99af-jw71.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes