MLLGMay 30, 2017

Objective-Reinforced Generative Adversarial Networks (ORGAN) for Sequence Generation Models

arXiv:1705.10843v3574 citations
Originality Incremental advance
AI Analysis

This addresses the need for controllable generation in domains like chemistry and music, but it is incremental as it builds on prior GAN and RL methods.

The authors tackled the problem of biasing unsupervised sequence generation towards desirable metrics by combining Generative Adversarial Networks (GANs) and reinforcement learning (RL), showing effective biasing in molecule and music generation tasks.

In unsupervised data generation tasks, besides the generation of a sample based on previous observations, one would often like to give hints to the model in order to bias the generation towards desirable metrics. We propose a method that combines Generative Adversarial Networks (GANs) and reinforcement learning (RL) in order to accomplish exactly that. While RL biases the data generation process towards arbitrary metrics, the GAN component of the reward function ensures that the model still remembers information learned from data. We build upon previous results that incorporated GANs and RL in order to generate sequence data and test this model in several settings for the generation of molecules encoded as text sequences (SMILES) and in the context of music generation, showing for each case that we can effectively bias the generation process towards desired metrics.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes