CLNov 8, 2022

What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation

arXiv:2211.04052v2224 citationsh-index: 35
AI Analysis

This work addresses the interpretability and efficiency issues in domain adaptation for machine translation, though it is incremental as it builds on the existing kNN-MT paradigm.

The paper tackles the problem of large and redundant datastores in kNN-MT domain adaptation by investigating what knowledge neural machine translation models need, proposing local correctness to identify conditions where models fail; experiments on six domains and two language pairs show that pruning based on this approach creates a lighter and more explainable memory.

kNN-MT presents a new paradigm for domain adaptation by building an external datastore, which usually saves all target language token occurrences in the parallel corpus. As a result, the constructed datastore is usually large and possibly redundant. In this paper, we investigate the interpretability issue of this approach: what knowledge does the NMT model need? We propose the notion of local correctness (LAC) as a new angle, which describes the potential translation correctness for a single entry and for a given neighborhood. Empirical study shows that our investigation successfully finds the conditions where the NMT model could easily fail and need related knowledge. Experiments on six diverse target domains and two language-pairs show that pruning according to local correctness brings a light and more explainable memory for kNN-MT domain adaptation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes