CLOct 12, 2020

Reformulating Unsupervised Style Transfer as Paraphrase Generation

arXiv:2010.05700v11051 citations
Originality Incremental advance
AI Analysis

This addresses the issue of meaning distortion in style transfer for NLP applications, offering a more reliable approach.

The paper tackles the problem of unsupervised style transfer by reformulating it as paraphrase generation, and demonstrates that a simple method based on fine-tuning pretrained language models on automatically generated paraphrase data significantly outperforms state-of-the-art systems in human and automatic evaluations.

Modern NLP defines the task of style transfer as modifying the style of a given sentence without appreciably changing its semantics, which implies that the outputs of style transfer systems should be paraphrases of their inputs. However, many existing systems purportedly designed for style transfer inherently warp the input's meaning through attribute transfer, which changes semantic properties such as sentiment. In this paper, we reformulate unsupervised style transfer as a paraphrase generation problem, and present a simple methodology based on fine-tuning pretrained language models on automatically generated paraphrase data. Despite its simplicity, our method significantly outperforms state-of-the-art style transfer systems on both human and automatic evaluations. We also survey 23 style transfer papers and discover that existing automatic metrics can be easily gamed and propose fixed variants. Finally, we pivot to a more real-world style transfer setting by collecting a large dataset of 15M sentences in 11 diverse styles, which we use for an in-depth analysis of our system.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes