LGMLMar 18, 2019

M$^2$VAE - Derivation of a Multi-Modal Variational Autoencoder Objective from the Marginal Joint Log-Likelihood

arXiv:1903.07303v12 citations
Originality Synthesis-oriented
AI Analysis

This work provides a theoretical foundation for multi-modal VAEs, which is incremental as it builds on existing VAE theory.

The paper tackles the problem of deriving a trainable objective for multi-modal variational autoencoders by presenting an in-depth derivation of the evidence lower bound from the marginal joint log-likelihood, resulting in a framework for training such models.

This work gives an in-depth derivation of the trainable evidence lower bound obtained from the marginal joint log-Likelihood with the goal of training a Multi-Modal Variational Autoencoder (M$^2$VAE).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes