CLDec 30, 2021

QEMind: Alibaba's Submission to the WMT21 Quality Estimation Shared Task

arXiv:2112.14890v1649 citations
Originality Synthesis-oriented
AI Analysis

This work addresses quality control for machine translation users, but it is incremental as it builds on existing pre-trained models and features.

The paper tackles the problem of quality estimation for machine translation without reference translations, presenting QEMind, a system that uses XLM-Roberta and additional features to outperform the best system from the previous year's Direct Assessment task.

Quality Estimation, as a crucial step of quality control for machine translation, has been explored for years. The goal is to investigate automatic methods for estimating the quality of machine translation results without reference translations. In this year's WMT QE shared task, we utilize the large-scale XLM-Roberta pre-trained model and additionally propose several useful features to evaluate the uncertainty of the translations to build our QE system, named \textit{QEMind}. The system has been applied to the sentence-level scoring task of Direct Assessment and the binary score prediction task of Critical Error Detection. In this paper, we present our submissions to the WMT 2021 QE shared task and an extensive set of experimental results have shown us that our multilingual systems outperform the best system in the Direct Assessment QE task of WMT 2020.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes