CLOct 23, 2023

Rethinking Word-Level Auto-Completion in Computer-Aided Translation

Tencent
arXiv:2310.14523v2131 citationsh-index: 27
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in translation tools for human translators, offering an incremental improvement over existing methods.

The paper tackled the problem of word-level auto-completion in computer-assisted translation by rethinking what makes good auto-completions, introducing a measurable criterion, and proposing a general approach that outperforms the top system in WMT2022 shared tasks with smaller model sizes.

Word-Level Auto-Completion (WLAC) plays a crucial role in Computer-Assisted Translation. It aims at providing word-level auto-completion suggestions for human translators. While previous studies have primarily focused on designing complex model architectures, this paper takes a different perspective by rethinking the fundamental question: what kind of words are good auto-completions? We introduce a measurable criterion to answer this question and discover that existing WLAC models often fail to meet this criterion. Building upon this observation, we propose an effective approach to enhance WLAC performance by promoting adherence to the criterion. Notably, the proposed approach is general and can be applied to various encoder-based architectures. Through extensive experiments, we demonstrate that our approach outperforms the top-performing system submitted to the WLAC shared tasks in WMT2022, while utilizing significantly smaller model sizes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes