CLJun 7, 2018

Characterizing Departures from Linearity in Word Translation

arXiv:1806.04508v232.11106 citations

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of improving word translation accuracy for NLP researchers, though it is incremental as it characterizes existing methods rather than introducing new ones.

The study investigated the non-linearity of word translation maps between languages by approximating them locally with linear maps, finding that these maps vary across the embedding space and correlate with neighborhood distances.

We investigate the behavior of maps learned by machine translation methods. The maps translate words by projecting between word embedding spaces of different languages. We locally approximate these maps using linear maps, and find that they vary across the word embedding space. This demonstrates that the underlying maps are non-linear. Importantly, we show that the locally linear maps vary by an amount that is tightly correlated with the distance between the neighborhoods on which they are trained. Our results can be used to test non-linear methods, and to drive the design of more accurate maps for word translation.

View on arXiv PDF

Similar