CLMay 19, 2023

BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases

arXiv:2305.12018v1233 citations
Originality Incremental advance
AI Analysis

It addresses efficiency and fluency issues in controlled text generation for real-world applications, representing an incremental improvement over existing methods.

The paper tackles the slow sampling problem in energy-based models for controlled text generation by proposing BOLT, which uses tunable biases to adjust language model logits, achieving 7x faster speed and higher fluency in 74.4% of samples compared to baselines.

Energy-based models (EBMs) have gained popularity for controlled text generation due to their high applicability to a wide range of constraints. However, sampling from EBMs is non-trivial, as it often requires a large number of iterations to converge to plausible text, which slows down the decoding process and makes it less practical for real-world applications. In this work, we propose BOLT, which relies on tunable biases to directly adjust the language model's output logits. Unlike prior work, BOLT maintains the generator's autoregressive nature to assert a strong control on token-wise conditional dependencies and overall fluency, and thus converges faster. When compared with state-of-the-arts on controlled generation tasks using both soft constraints (e.g., sentiment control) and hard constraints (e.g., keyword-guided topic control), BOLT demonstrates significantly improved efficiency and fluency. On sentiment control, BOLT is 7x faster than competitive baselines, and more fluent in 74.4% of the evaluation samples according to human judges.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes