CLSep 15, 2021

Miðeind's WMT 2021 submission

arXiv:2109.07343v113 citations
Originality Synthesis-oriented
AI Analysis

Incremental improvement for machine translation in low-resource language pairs.

The paper tackled machine translation for English-Icelandic language pairs by iteratively generating backtranslations and adapting a pretrained mBART-25 model, achieving unspecified results without concrete numbers.

We present Miðeind's submission for the English$\to$Icelandic and Icelandic$\to$English subsets of the 2021 WMT news translation task. Transformer-base models are trained for translation on parallel data to generate backtranslations iteratively. A pretrained mBART-25 model is then adapted for translation using parallel data as well as the last backtranslation iteration. This adapted pretrained model is then used to re-generate backtranslations, and the training of the adapted model is continued.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes