CL LGJul 17, 2021

Generative Pretraining for Paraphrase Evaluation

Jack Weston, Raphael Lenain, Udeepa Meepegama, Emil Fristed

arXiv:2107.08251v230.3638 citations

Originality Incremental advance

AI Analysis

This work addresses the problem of accurate and data-efficient paraphrase evaluation for researchers and practitioners in natural language processing, representing an incremental improvement over existing metrics.

The authors tackled the problem of evaluating text generation by introducing ParaBLEU, a paraphrase representation learning model and evaluation metric that uses generative conditioning as a pretraining objective. It achieved new state-of-the-art results on the 2017 WMT Metrics Shared Task, correlating more strongly with human judgments, and demonstrated robustness by exceeding previous SOTA with only 50% of training data and surpassing BLEU, ROUGE, and METEOR with only 40 labeled examples.

We introduce ParaBLEU, a paraphrase representation learning model and evaluation metric for text generation. Unlike previous approaches, ParaBLEU learns to understand paraphrasis using generative conditioning as a pretraining objective. ParaBLEU correlates more strongly with human judgements than existing metrics, obtaining new state-of-the-art results on the 2017 WMT Metrics Shared Task. We show that our model is robust to data scarcity, exceeding previous state-of-the-art performance using only $50\%$ of the available training data and surpassing BLEU, ROUGE and METEOR with only $40$ labelled examples. Finally, we demonstrate that ParaBLEU can be used to conditionally generate novel paraphrases from a single demonstration, which we use to confirm our hypothesis that it learns abstract, generalized paraphrase representations.

View on arXiv PDF

Similar