CLAIMar 22, 2022

Learning Confidence for Transformer-based Neural Machine Translation

arXiv:2203.11413v1644 citationsh-index: 39
Originality Incremental advance
AI Analysis

This work addresses the problem of unreliable confidence estimates in NMT for real-world applications, offering an incremental improvement through a novel learning approach.

The paper tackles the challenge of confidence estimation in neural machine translation by proposing an unsupervised method that learns confidence jointly with the model training, achieving high accuracy on quality estimation tasks and enabling effective detection of noisy samples and out-of-domain data.

Confidence estimation aims to quantify the confidence of the model prediction, providing an expectation of success. A well-calibrated confidence estimate enables accurate failure prediction and proper risk measurement when given noisy samples and out-of-distribution data in real-world settings. However, this task remains a severe challenge for neural machine translation (NMT), where probabilities from softmax distribution fail to describe when the model is probably mistaken. To address this problem, we propose an unsupervised confidence estimate learning jointly with the training of the NMT model. We explain confidence as how many hints the NMT model needs to make a correct prediction, and more hints indicate low confidence. Specifically, the NMT model is given the option to ask for hints to improve translation accuracy at the cost of some slight penalty. Then, we approximate their level of confidence by counting the number of hints the model uses. We demonstrate that our learned confidence estimate achieves high accuracy on extensive sentence/word-level quality estimation tasks. Analytical results verify that our confidence estimate can correctly assess underlying risk in two real-world scenarios: (1) discovering noisy samples and (2) detecting out-of-domain data. We further propose a novel confidence-based instance-specific label smoothing approach based on our learned confidence estimate, which outperforms standard label smoothing.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes