CLJan 2, 2021

The Highs and Lows of Simple Lexical Domain Adaptation Approaches for Neural Machine Translation

arXiv:2101.00421v3662 citations
Originality Synthesis-oriented
AI Analysis

This work provides an incremental solution for improving neural machine translation quality for users in low-resource, out-of-domain scenarios.

This paper addresses the problem of poor quality and hallucinations in low-resource out-of-domain neural machine translation by adopting two computationally cheap lexical domain adaptation approaches: lexical shortlisting restricted by IBM statistical alignments and hypothesis re-ranking based on similarity. The methods show success on low-resource out-of-domain test sets but are ineffective with sufficient data or large domain mismatch.

Machine translation systems are vulnerable to domain mismatch, especially in a low-resource scenario. Out-of-domain translations are often of poor quality and prone to hallucinations, due to exposure bias and the decoder acting as a language model. We adopt two approaches to alleviate this problem: lexical shortlisting restricted by IBM statistical alignments, and hypothesis re-ranking based on similarity. The methods are computationally cheap, widely known, but not extensively experimented on domain adaptation. We demonstrate success on low-resource out-of-domain test sets, however, the methods are ineffective when there is sufficient data or too great domain mismatch. This is due to both the IBM model losing its advantage over the implicitly learned neural alignment, and issues with subword segmentation of out-of-domain words.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes