CLApr 25, 2022

Which Discriminator for Cooperative Text Generation?

arXiv:2204.11586v14 citationsh-index: 32
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of improving text generation quality for AI applications by integrating external classifiers, though it is incremental as it compares existing discriminator types rather than introducing a new method.

The paper investigates three types of transformer-based discriminators (bidirectional, left-to-right, and generative) for cooperative text generation, where a classifier guides language model decoding to produce texts with desired properties like naturalness or style, evaluating their accuracy, sample quality, and computational performance.

Language models generate texts by successively predicting probability distributions for next tokens given past ones. A growing field of interest tries to leverage external information in the decoding process so that the generated texts have desired properties, such as being more natural, non toxic, faithful, or having a specific writing style. A solution is to use a classifier at each generation step, resulting in a cooperative environment where the classifier guides the decoding of the language model distribution towards relevant texts for the task at hand. In this paper, we examine three families of (transformer-based) discriminators for this specific task of cooperative decoding: bidirectional, left-to-right and generative ones. We evaluate the pros and cons of these different types of discriminators for cooperative generation, exploring respective accuracy on classification tasks along with their impact on the resulting sample quality and computational performances. We also provide the code of a batched implementation of the powerful cooperative decoding strategy used for our experiments, the Monte Carlo Tree Search, working with each discriminator for Natural Language Generation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes