CLOct 14, 2023

Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model

arXiv:2310.09520v4161 citationsh-index: 59
AI Analysis

This addresses the need for efficient and effective controlled text generation in AI applications, though it is incremental as it builds on existing reward-based methods.

The paper tackles the problem of controlling text generation in large language models to avoid problematic content or achieve desired attributes, introducing Reward-Augmented Decoding (RAD) which matches state-of-the-art re-training methods while minimizing computational overhead.

While large language models have proven effective in a huge range of downstream applications, they often generate text that is problematic or lacks a desired attribute. In this paper, we introduce Reward-Augmented Decoding (RAD), a text generation procedure that uses a small unidirectional reward model to encourage a language model to generate text that has certain properties. Specifically, RAD uses the reward model to score generations as they are produced and rescales sampling probabilities to favor high-reward tokens. By using a unidirectional reward model, RAD can cache activations from prior generation steps to decrease computational overhead. Through experiments on generating non-toxic and sentiment-controlled text, we demonstrate that RAD performs best among methods that change only the generation procedure and matches the performance of state-of-the-art methods that involve re-training the language model. We further validate that RAD is effective on very large language models while incurring a minimal computational overhead.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes