CLDec 1, 2022

CUNI Systems for the WMT22 Czech-Ukrainian Translation Task

arXiv:2212.00486v18 citationsh-index: 28
Originality Synthesis-oriented
AI Analysis

This addresses translation needs for Czech-Ukrainian language pairs, particularly relevant for migration contexts, but is incremental as it builds on existing methods.

The paper tackled Czech-Ukrainian machine translation for the WMT22 task, presenting systems based on back-translation and romanization, with results showing romanization had only a minor effect on quality.

We present Charles University submissions to the WMT22 General Translation Shared Task on Czech-Ukrainian and Ukrainian-Czech machine translation. We present two constrained submissions based on block back-translation and tagged back-translation and experiment with rule-based romanization of Ukrainian. Our results show that the romanization only has a minor effect on the translation quality. Further, we describe Charles Translator, a system that was developed in March 2022 as a response to the migration from Ukraine to the Czech Republic. Compared to our constrained systems, it did not use the romanization and used some proprietary data sources.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes