AI CL LGFeb 26, 2017

Maximum-Likelihood Augmented Discrete Generative Adversarial Networks

Tong Che, Yanran Li, Ruixiang Zhang, R Devon Hjelm, Wenjie Li, Yangqiu Song, Yoshua Bengio

arXiv:1702.07983v133.5249 citations

Originality Highly original

AI Analysis

This addresses a fundamental problem in machine learning for researchers and practitioners working with discrete data like text, offering a more stable training method for GANs in such domains.

The paper tackles the challenge of applying generative adversarial networks (GANs) to discrete settings, such as natural language tasks, by proposing a novel low-variance objective derived from the discriminator's output that corresponds to log-likelihood, resulting in improved stability and effectiveness across various discrete datasets.

Despite the successes in capturing continuous distributions, the application of generative adversarial networks (GANs) to discrete settings, like natural language tasks, is rather restricted. The fundamental reason is the difficulty of back-propagation through discrete random variables combined with the inherent instability of the GAN training objective. To address these problems, we propose Maximum-Likelihood Augmented Discrete Generative Adversarial Networks. Instead of directly optimizing the GAN objective, we derive a novel and low-variance objective using the discriminator's output that follows corresponds to the log-likelihood. Compared with the original, the new objective is proved to be consistent in theory and beneficial in practice. The experimental results on various discrete datasets demonstrate the effectiveness of the proposed approach.

View on arXiv PDF

Similar