CLOct 23, 2023

Rethinking Word-Level Auto-Completion in Computer-Aided Translation

Xingyu Chen, Lemao Liu, Guoping Huang, Zhirui Zhang, Mingming Yang, Shuming Shi, Rui Wang

Tencent

arXiv:2310.14523v221.0131 citationsh-index: 27Has Code

Originality Incremental advance

AI Analysis

This work addresses a specific bottleneck in translation tools for human translators, offering an incremental improvement over existing methods.

The paper tackled the problem of word-level auto-completion in computer-assisted translation by rethinking what makes good auto-completions, introducing a measurable criterion, and proposing a general approach that outperforms the top system in WMT2022 shared tasks with smaller model sizes.

Word-Level Auto-Completion (WLAC) plays a crucial role in Computer-Assisted Translation. It aims at providing word-level auto-completion suggestions for human translators. While previous studies have primarily focused on designing complex model architectures, this paper takes a different perspective by rethinking the fundamental question: what kind of words are good auto-completions? We introduce a measurable criterion to answer this question and discover that existing WLAC models often fail to meet this criterion. Building upon this observation, we propose an effective approach to enhance WLAC performance by promoting adherence to the criterion. Notably, the proposed approach is general and can be applied to various encoder-based architectures. Through extensive experiments, we demonstrate that our approach outperforms the top-performing system submitted to the WLAC shared tasks in WMT2022, while utilizing significantly smaller model sizes.

View on arXiv PDF Code

Similar