M$^2$VAE - Derivation of a Multi-Modal Variational Autoencoder Objective from the Marginal Joint Log-Likelihood
arXiv:1903.07303v12 citations
Originality Synthesis-oriented
AI Analysis
This work provides a theoretical foundation for multi-modal VAEs, which is incremental as it builds on existing VAE theory.
The paper tackles the problem of deriving a trainable objective for multi-modal variational autoencoders by presenting an in-depth derivation of the evidence lower bound from the marginal joint log-likelihood, resulting in a framework for training such models.
This work gives an in-depth derivation of the trainable evidence lower bound obtained from the marginal joint log-Likelihood with the goal of training a Multi-Modal Variational Autoencoder (M$^2$VAE).