CLFeb 23, 2019

Augmenting Neural Machine Translation with Knowledge Graphs

arXiv:1902.08816v127 citations
Originality Incremental advance
AI Analysis

This addresses translation quality issues for entities and terminological expressions in machine translation systems, though it is incremental as it builds on existing neural architectures.

The paper tackles the problem of data sparsity and out-of-vocabulary words in neural machine translation by augmenting models with knowledge graphs, resulting in an average improvement of +3 BLEU, METEOR, and chrF3 on WMT English-German datasets.

While neural networks have been used extensively to make substantial progress in the machine translation task, they are known for being heavily dependent on the availability of large amounts of training data. Recent efforts have tried to alleviate the data sparsity problem by augmenting the training data using different strategies, such as back-translation. Along with the data scarcity, the out-of-vocabulary words, mostly entities and terminological expressions, pose a difficult challenge to Neural Machine Translation systems. In this paper, we hypothesize that knowledge graphs enhance the semantic feature extraction of neural models, thus optimizing the translation of entities and terminological expressions in texts and consequently leading to a better translation quality. We hence investigate two different strategies for incorporating knowledge graphs into neural models without modifying the neural network architectures. We also examine the effectiveness of our augmentation method to recurrent and non-recurrent (self-attentional) neural architectures. Our knowledge graph augmented neural translation model, dubbed KG-NMT, achieves significant and consistent improvements of +3 BLEU, METEOR and chrF3 on average on the newstest datasets between 2014 and 2018 for WMT English-German translation task.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes