CLMar 9, 2022

Onception: Active Learning with Expert Advice for Real World Machine Translation

Vânia Mendonça, Ricardo Rei, Luisa Coheur, Alberto Sardinha

arXiv:2203.04507v221.6227 citationsh-index: 18Has Code

Originality Incremental advance

AI Analysis

This work addresses the problem of reducing annotation costs for machine translation in low-resource, streaming settings, though it is incremental as it builds on existing active learning and expert advice methods.

The paper tackles the challenge of active learning for machine translation in a real-world streaming scenario where human feedback is limited to ratings rather than full translations, and shows that combining multiple query strategies with expert advice reduces the number of human interactions needed to converge to the best translation systems.

Active learning can play an important role in low-resource settings (i.e., where annotated data is scarce), by selecting which instances may be more worthy to annotate. Most active learning approaches for Machine Translation assume the existence of a pool of sentences in a source language, and rely on human annotators to provide translations or post-edits, which can still be costly. In this article, we assume a real world human-in-the-loop scenario in which: (i) the source sentences may not be readily available, but instead arrive in a stream; (ii) the automatic translations receive feedback in the form of a rating, instead of a correct/edited translation, since the human-in-the-loop might be a user looking for a translation, but not be able to provide one. To tackle the challenge of deciding whether each incoming pair source-translations is worthy to query for human feedback, we resort to a number of stream-based active learning query strategies. Moreover, since we not know in advance which query strategy will be the most adequate for a certain language pair and set of Machine Translation models, we propose to dynamically combine multiple strategies using prediction with expert advice. Our experiments show that using active learning allows to converge to the best Machine Translation systems with fewer human interactions. Furthermore, combining multiple strategies using prediction with expert advice often outperforms several individual active learning strategies with even fewer interactions.

View on arXiv PDF Code

Similar