LGCVMLJun 9, 2020

Super-resolution Variational Auto-Encoders

arXiv:2006.05218v215 citations
AI Analysis

This addresses image quality problems in VAE-based generative models, but it is an incremental improvement over existing methods.

The paper tackled the blurriness issue in variational autoencoders (VAEs) by adding a downscaled image variable and using it for super-resolution-like decoding, resulting in comparable log-likelihood but better FID scores for image synthesis.

The framework of variational autoencoders (VAEs) provides a principled method for jointly learning latent-variable models and corresponding inference models. However, the main drawback of this approach is the blurriness of the generated images. Some studies link this effect to the objective function, namely, the (negative) log-likelihood. Here, we propose to enhance VAEs by adding a random variable that is a downscaled version of the original image and still use the log-likelihood function as the learning objective. Further, by providing the downscaled image as an input to the decoder, it can be used in a manner similar to the super-resolution. We present empirically that the proposed approach performs comparably to VAEs in terms of the negative log-likelihood, but it obtains a better FID score in data synthesis.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes