CLAug 6, 2025

DTPA: Dynamic Token-level Prefix Augmentation for Controllable Text Generation

arXiv:2508.04047v12 citationsh-index: 11
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in controllable text generation for NLP applications, offering an incremental improvement over existing methods.

The paper tackles the problem of declining controllability in long-form text generation using prefix-based methods by proposing Dynamic Token-level Prefix Augmentation (DTPA), which dynamically amplifies attention to prefixes and achieves superior attribute control while maintaining competitive fluency, diversity, and topic relevance across multiple tasks.

Controllable Text Generation (CTG) is a vital subfield in Natural Language Processing (NLP), aiming to generate text that aligns with desired attributes. However, previous studies commonly focus on the quality of controllable text generation for short sequences, while the generation of long-form text remains largely underexplored. In this paper, we observe that the controllability of texts generated by the powerful prefix-based method Air-Decoding tends to decline with increasing sequence length, which we hypothesize primarily arises from the observed decay in attention to the prefixes. Meanwhile, different types of prefixes including soft and hard prefixes are also key factors influencing performance. Building on these insights, we propose a lightweight and effective framework called Dynamic Token-level Prefix Augmentation (DTPA) based on Air-Decoding for controllable text generation. Specifically, it first selects the optimal prefix type for a given task. Then we dynamically amplify the attention to the prefix for the attribute distribution to enhance controllability, with a scaling factor growing exponentially as the sequence length increases. Moreover, based on the task, we optionally apply a similar augmentation to the original prompt for the raw distribution to balance text quality. After attribute distribution reconstruction, the generated text satisfies the attribute constraints well. Experiments on multiple CTG tasks demonstrate that DTPA generally outperforms other methods in attribute control while maintaining competitive fluency, diversity, and topic relevance. Further analysis highlights DTPA's superior effectiveness in long text generation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes