CVLGOct 9, 2023

EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders

arXiv:2310.05718v321 citationsh-index: 17Has Code
Originality Incremental advance
AI Analysis

This addresses a common issue in training deep generative models with discrete representations, offering an incremental improvement for researchers in generative modeling.

The paper tackles the codebook collapse problem in discrete variational autoencoders (dVAEs) by replacing softmax with evidential deep learning, resulting in improved reconstruction performance and enhanced codebook usage compared to existing models.

Codebook collapse is a common problem in training deep generative models with discrete representation spaces like Vector Quantized Variational Autoencoders (VQ-VAEs). We observe that the same problem arises for the alternatively designed discrete variational autoencoders (dVAEs) whose encoder directly learns a distribution over the codebook embeddings to represent the data. We hypothesize that using the softmax function to obtain a probability distribution causes the codebook collapse by assigning overconfident probabilities to the best matching codebook elements. In this paper, we propose a novel way to incorporate evidential deep learning (EDL) instead of softmax to combat the codebook collapse problem of dVAE. We evidentially monitor the significance of attaining the probability distribution over the codebook embeddings, in contrast to softmax usage. Our experiments using various datasets show that our model, called EdVAE, mitigates codebook collapse while improving the reconstruction performance, and enhances the codebook usage compared to dVAE and VQ-VAE based models. Our code can be found at https://github.com/ituvisionlab/EdVAE .

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes