Improved ArtGAN for Conditional Synthesis of Natural Image and Artwork
This work addresses image generation for applications in art and natural imagery, but it is incremental as it builds on existing GAN methods with specific improvements.
The paper tackles conditional image synthesis by proposing ArtGAN, which introduces label gradient feedback and an autoencoder-enhanced discriminator, achieving state-of-the-art Inception scores on CIFAR-10 and generating plausible images on datasets like Oxford-102 and CUB-200.
This paper proposes a series of new approaches to improve Generative Adversarial Network (GAN) for conditional image synthesis and we name the proposed model as ArtGAN. One of the key innovation of ArtGAN is that, the gradient of the loss function w.r.t. the label (randomly assigned to each generated image) is back-propagated from the categorical discriminator to the generator. With the feedback from the label information, the generator is able to learn more efficiently and generate image with better quality. Inspired by recent works, an autoencoder is incorporated into the categorical discriminator for additional complementary information. Last but not least, we introduce a novel strategy to improve the image quality. In the experiments, we evaluate ArtGAN on CIFAR-10 and STL-10 via ablation studies. The empirical results showed that our proposed model outperforms the state-of-the-art results on CIFAR-10 in terms of Inception score. Qualitatively, we demonstrate that ArtGAN is able to generate plausible-looking images on Oxford-102 and CUB-200, as well as able to draw realistic artworks based on style, artist, and genre. The source code and models are available at: https://github.com/cs-chan/ArtGAN