CLAIJun 12, 2024

Prompt-Based Length Controlled Generation with Multiple Control Types

arXiv:2406.10278v136 citations
Originality Incremental advance
AI Analysis

This addresses a practical need for users who require precise length control in text generation, though it is incremental as it builds on existing length control methods.

The paper tackles the problem of generating text within specific length ranges using large language models, proposing a prompt-based method that improves accuracy on summarization datasets like CNNDM and NYT under multiple control types such as 'equal to' and others.

Large language models (LLMs) have attracted great attention given their strong performance on a wide range of NLP tasks. In practice, users often expect generated texts to fall within a specific length range, making length controlled generation an important topic, especially for GPT-style models. Existing length control methods mostly focus on a simple control type of "equal to" a target length. Different from them, we propose a prompt-based method to achieve length controlled generation under different control types with high accuracy. In particular, we adopt reinforcement learning (RL) and sample filtering with the reward signal given by rule-based reward models, which enhances the length control ability of models by rewarding outputs that follow certain control instructions. In addition, we introduce a standard prompt extractor to parse arbitrary users' input into standard control instructions. Experiments show that our method significantly improves the accuracy of prompt-based length control on popular summarization datasets like CNNDM and NYT under multiple control types. Moreover, both the standard prompt extractor and RL-tuned model show strong generalization to unseen control prompt templates.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes