LGMLApr 16, 2020

Classify and Generate: Using Classification Latent Space Representations for Image Generations

arXiv:2004.07543v24 citations
AI Analysis

This addresses the challenge of combining discriminative and generative features for image tasks, offering a novel approach for controlled generation, though it appears incremental in the broader context of generative modeling.

The paper tackles the problem of using classification latent spaces for image reconstruction and generation, proposing ReGene which directly represents data in classification space and achieves higher classification accuracy than existing conditional generative models while being competitive in terms of FID.

Utilization of classification latent space information for downstream reconstruction and generation is an intriguing and a relatively unexplored area. In general, discriminative representations are rich in class-specific features but are too sparse for reconstruction, whereas, in autoencoders the representations are dense but have limited indistinguishable class-specific features, making them less suitable for classification. In this work, we propose a discriminative modeling framework that employs manipulated supervised latent representations to reconstruct and generate new samples belonging to a given class. Unlike generative modeling approaches such as GANs and VAEs that aim to model the data manifold distribution, Representation based Generations (ReGene) directly represent the given data manifold in the classification space. Such supervised representations, under certain constraints, allow for reconstructions and controlled generations using an appropriate decoder without enforcing any prior distribution. Theoretically, given a class, we show that these representations when smartly manipulated using convex combinations retain the same class label. Furthermore, they also lead to the novel generation of visually realistic images. Extensive experiments on datasets of varying resolutions demonstrate that ReGene has higher classification accuracy than existing conditional generative models while being competitive in terms of FID.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes