CL AIFeb 17, 2021

Sparsely Factored Neural Machine Translation

Noe Casas, Jose A. R. Fonollosa, Marta R. Costa-jussà

arXiv:2102.08934v10.2Has Code

Originality Incremental advance

AI Analysis

This addresses the challenge of handling non-dense linguistic features in machine translation for low-resource, morphologically-rich languages, representing an incremental improvement over standard methods.

The paper tackled the problem of incorporating sparse linguistic annotations into neural machine translation by proposing a new method that improves out-of-domain performance with large gains while maintaining comparable in-domain quality, specifically tested on morphologically-rich languages like Basque and German in low-resource settings.

The standard approach to incorporate linguistic information to neural machine translation systems consists in maintaining separate vocabularies for each of the annotated features to be incorporated (e.g. POS tags, dependency relation label), embed them, and then aggregate them with each subword in the word they belong to. This approach, however, cannot easily accommodate annotation schemes that are not dense for every word. We propose a method suited for such a case, showing large improvements in out-of-domain data, and comparable quality for the in-domain data. Experiments are performed in morphologically-rich languages like Basque and German, for the case of low-resource scenarios.

View on arXiv PDF Code

Similar