CLApr 24, 2017

Found in Translation: Reconstructing Phylogenetic Language Trees from Translations

arXiv:1704.07146v149 citations
Originality Incremental advance
AI Analysis

This addresses a fundamental issue in computational linguistics and translation studies by revealing persistent linguistic traces, with incremental implications for language evolution analysis.

The study tackled the problem of detecting source language influence in translated texts by reconstructing phylogenetic language trees from monolingual translations, finding that source language interference remains dominant even after two translation phases, overshadowing universal translation properties.

Translation has played an important role in trade, law, commerce, politics, and literature for thousands of years. Translators have always tried to be invisible; ideal translations should look as if they were written originally in the target language. We show that traces of the source language remain in the translation product to the extent that it is possible to uncover the history of the source language by looking only at the translation. Specifically, we automatically reconstruct phylogenetic language trees from monolingual texts (translated from several source languages). The signal of the source language is so powerful that it is retained even after two phases of translation. This strongly indicates that source language interference is the most dominant characteristic of translated texts, overshadowing the more subtle signals of universal properties of translation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes