CLDec 13, 2021

English2Gbe: A multilingual machine translation model for {Fon/Ewe}Gbe

arXiv:2112.11482v15 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of limited multilingual translation adoption for low-resourced languages, benefiting communities by providing a more efficient and effective translation tool.

The paper tackles the challenge of low-resourced African languages by developing English2Gbe, a multilingual machine translation model that translates from English to Ewe or Fon, outperforming bilingual models and achieving state-of-the-art results on the JW300 benchmark for Fon.

Language is an essential factor of emancipation. Unfortunately, most of the more than 2,000 African languages are low-resourced. The community has recently used machine translation to revive and strengthen several African languages. However, the trained models are often bilingual, resulting in a potentially exponential number of models to train and maintain to cover all possible translation directions. Additionally, bilingual models do not leverage the similarity between some of the languages. Consequently, multilingual neural machine translation (NMT) is gaining considerable interest, especially for low-resourced languages. Nevertheless, its adoption by the community is still limited. This paper introduces English2Gbe, a multilingual NMT model capable of translating from English to Ewe or Fon. Using the BLEU, CHRF, and TER scores computed with the Sacrebleu (Post, 2018) package for reproducibility, we show that English2Gbe outperforms bilingual models (English to Ewe and English to Fon) and gives state-of-the-art results on the JW300 benchmark for Fon established by Nekoto et al. (2020). We hope this work will contribute to the massive adoption of Multilingual models inside the community. Our code is made accessible from Github.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes