CLApr 24, 2024

Effective Unsupervised Constrained Text Generation based on Perturbed Masking

arXiv:2404.15877v1638 citationsh-index: 44Findings
Originality Incremental advance
AI Analysis

This addresses the challenge of generating text under constraints without supervised data, offering an incremental improvement over existing methods.

The paper tackles the problem of unsupervised constrained text generation by proposing PMCTG, which searches for the best edit position and action to reduce unnecessary steps, achieving new state-of-the-art results in keywords-to-sentence generation and paraphrasing tasks.

Unsupervised constrained text generation aims to generate text under a given set of constraints without any supervised data. Current state-of-the-art methods stochastically sample edit positions and actions, which may cause unnecessary search steps. In this paper, we propose PMCTG to improve effectiveness by searching for the best edit position and action in each step. Specifically, PMCTG extends perturbed masking technique to effectively search for the most incongruent token to edit. Then it introduces four multi-aspect scoring functions to select edit action to further reduce search difficulty. Since PMCTG does not require supervised data, it could be applied to different generation tasks. We show that under the unsupervised setting, PMCTG achieves new state-of-the-art results in two representative tasks, namely keywords-to-sentence generation and paraphrasing.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes