CLAIFeb 22, 2024

GATE X-E : A Challenge Set for Gender-Fair Translations from Weakly-Gendered Languages

arXiv:2402.14277v128 citationsh-index: 6Has CodeACL
Originality Synthesis-oriented
AI Analysis

This addresses the problem of gender bias in translation for researchers and practitioners by providing a benchmark, but it is incremental as it extends an existing corpus.

The paper tackles the lack of benchmarks for evaluating gender bias in neural machine translation from weakly-gendered languages to English by introducing GATE X-E, a dataset with human translations from four languages, each with feminine, masculine, and neutral variants, containing 1250 to 1850 instances per language pair, and they use it to evaluate a GPT-4-based translation gender rewriting solution.

Neural Machine Translation (NMT) continues to improve in quality and adoption, yet the inadvertent perpetuation of gender bias remains a significant concern. Despite numerous studies on gender bias in translations into English from weakly gendered-languages, there are no benchmarks for evaluating this phenomenon or for assessing mitigation strategies. To address this gap, we introduce GATE X-E, an extension to the GATE (Rarrick et al., 2023) corpus, that consists of human translations from Turkish, Hungarian, Finnish, and Persian into English. Each translation is accompanied by feminine, masculine, and neutral variants. The dataset, which contains between 1250 and 1850 instances for each of the four language pairs, features natural sentences with a wide range of sentence lengths and domains, challenging translation rewriters on various linguistic phenomena. Additionally, we present a translation gender rewriting solution built with GPT-4 and use GATE X-E to evaluate it. We open source our contributions to encourage further research on gender debiasing.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes