CLLGJul 17, 2021

Generative Pretraining for Paraphrase Evaluation

arXiv:2107.08251v2638 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of accurate and data-efficient paraphrase evaluation for researchers and practitioners in natural language processing, representing an incremental improvement over existing metrics.

The authors tackled the problem of evaluating text generation by introducing ParaBLEU, a paraphrase representation learning model and evaluation metric that uses generative conditioning as a pretraining objective. It achieved new state-of-the-art results on the 2017 WMT Metrics Shared Task, correlating more strongly with human judgments, and demonstrated robustness by exceeding previous SOTA with only 50% of training data and surpassing BLEU, ROUGE, and METEOR with only 40 labeled examples.

We introduce ParaBLEU, a paraphrase representation learning model and evaluation metric for text generation. Unlike previous approaches, ParaBLEU learns to understand paraphrasis using generative conditioning as a pretraining objective. ParaBLEU correlates more strongly with human judgements than existing metrics, obtaining new state-of-the-art results on the 2017 WMT Metrics Shared Task. We show that our model is robust to data scarcity, exceeding previous state-of-the-art performance using only $50\%$ of the available training data and surpassing BLEU, ROUGE and METEOR with only $40$ labelled examples. Finally, we demonstrate that ParaBLEU can be used to conditionally generate novel paraphrases from a single demonstration, which we use to confirm our hypothesis that it learns abstract, generalized paraphrase representations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes