CLMar 6, 2025

Assumed Identities: Quantifying Gender Bias in Machine Translation of Gender-Ambiguous Occupational Terms

arXiv:2503.04372v35 citationsh-index: 29EMNLP
Originality Incremental advance
AI Analysis

This addresses bias evaluation in machine translation for researchers and practitioners, but it is incremental as it builds on existing bias detection methods.

The paper tackled the problem of gender bias in machine translation of gender-ambiguous occupational terms by introducing GRAPE, a probability-based metric, and GAMBIT, a benchmarking dataset, and found that MT systems in Greek and French showed systematic biases aligning with societal stereotypes.

Machine Translation (MT) systems frequently encounter gender-ambiguous occupational terms, where they must assign gender without explicit contextual cues. While individual translations in such cases may not be inherently biased, systematic patterns-such as consistently translating certain professions with specific genders-can emerge, reflecting and perpetuating societal stereotypes. This ambiguity challenges traditional instance-level single-answer evaluation approaches, as no single gold standard translation exists. To address this, we introduce GRAPE, a probability-based metric designed to evaluate gender bias by analyzing aggregated model responses. Alongside this, we present GAMBIT, a benchmarking dataset in English with gender-ambiguous occupational terms. Using GRAPE, we evaluate several MT systems and examine whether their gendered translations in Greek and French align with or diverge from societal stereotypes, real-world occupational gender distributions, and normative standards

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes