CLMay 6, 2021

Quantitative Evaluation of Alternative Translations in a Corpus of Highly Dissimilar Finnish Paraphrases

arXiv:2105.02477v1726 citations
AI Analysis

This addresses the challenge of translation variation analysis for NLP researchers, but it is incremental as it applies existing methods to a new dataset.

The paper tackled the problem of evaluating differences between alternative translations in a Finnish paraphrase corpus, finding that it contains highly non-trivial variants difficult to detect automatically.

In this paper, we present a quantitative evaluation of differences between alternative translations in a large recently released Finnish paraphrase corpus focusing in particular on non-trivial variation in translation. We combine a series of automatic steps detecting systematic variation with manual analysis to reveal regularities and identify categories of translation differences. We find the paraphrase corpus to contain highly non-trivial translation variants difficult to recognize through automatic approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes