CLLGApr 12, 2021

FUDGE: Controlled Text Generation With Future Discriminators

arXiv:2104.05218v2800 citations
Originality Incremental advance
AI Analysis

This addresses the problem of fine-grained control in text generation for NLP practitioners, though it is incremental as it builds on existing generation models.

The authors tackled controlled text generation by proposing FUDGE, a method that conditions pre-existing models on desired attributes using future discriminators, resulting in gains across poetry completion, topic control, and formality change tasks.

We propose Future Discriminators for Generation (FUDGE), a flexible and modular method for controlled text generation. Given a pre-existing model G for generating text from a distribution of interest, FUDGE enables conditioning on a desired attribute a (for example, formality) while requiring access only to G's output logits. FUDGE learns an attribute predictor operating on a partial sequence, and uses this predictor's outputs to adjust G's original probabilities. We show that FUDGE models terms corresponding to a Bayesian decomposition of the conditional distribution of G given attribute a. Moreover, FUDGE can easily compose predictors for multiple desired attributes. We evaluate FUDGE on three tasks -- couplet completion in poetry, topic control in language generation, and formality change in machine translation -- and observe gains in all three tasks.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes