CLAILGFeb 23, 2022

COLD Decoding: Energy-based Constrained Text Generation with Langevin Dynamics

arXiv:2202.11705v3200 citations
Originality Highly original
AI Analysis

It addresses the need for controlled text generation in applications like lexically-constrained generation, abductive reasoning, and counterfactual reasoning, offering a flexible solution that works with existing language models.

The paper tackles the problem of incorporating hard and soft constraints in text generation, presenting COLD decoding, a framework that uses energy functions and gradient-based sampling to achieve this without task-specific fine-tuning, demonstrating effectiveness in three applications through automatic and human evaluations.

Many applications of text generation require incorporating different constraints to control the semantics or style of generated text. These constraints can be hard (e.g., ensuring certain keywords are included in the output) and soft (e.g., contextualizing the output with the left- or right-hand context). In this paper, we present Energy-based Constrained Decoding with Langevin Dynamics (COLD), a decoding framework which unifies constrained generation as specifying constraints through an energy function, then performing efficient differentiable reasoning over the constraints through gradient-based sampling. COLD decoding is a flexible framework that can be applied directly to off-the-shelf left-to-right language models without the need for any task-specific fine-tuning, as demonstrated through three challenging text generation applications: lexically-constrained generation, abductive reasoning, and counterfactual reasoning. Our experiments on these constrained generation tasks point to the effectiveness of our approach, both in terms of automatic and human evaluation.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes