CLDec 10, 2024

Filipino Benchmarks for Measuring Sexist and Homophobic Bias in Multilingual Language Models from Southeast Asia

arXiv:2412.07303v213.520 citationsh-index: 2Has Code

Originality Synthesis-oriented

AI Analysis

This addresses bias measurement in low-resource languages for researchers and practitioners in NLP, though it is incremental as it adapts existing methods to a new context.

The researchers tackled the problem of measuring sexist and homophobic bias in multilingual language models for Filipino, a low-resource language, by introducing two new benchmarks with 7,074 challenge pairs, and found that these models contain considerable bias influenced by pretraining data exposure.

Bias studies on multilingual models confirm the presence of gender-related stereotypes in masked models processing languages with high NLP resources. We expand on this line of research by introducing Filipino CrowS-Pairs and Filipino WinoQueer: benchmarks that assess both sexist and anti-queer biases in pretrained language models (PLMs) handling texts in Filipino, a low-resource language from the Philippines. The benchmarks consist of 7,074 new challenge pairs resulting from our cultural adaptation of English bias evaluation datasets, a process that we document in detail to guide similar forthcoming efforts. We apply the Filipino benchmarks on masked and causal multilingual models, including those pretrained on Southeast Asian data, and find that they contain considerable amounts of bias. We also find that for multilingual models, the extent of bias learned for a particular language is influenced by how much pretraining data in that language a model was exposed to. Our benchmarks and insights can serve as a foundation for future work analyzing and mitigating bias in multilingual models.

View on arXiv PDF Code

Similar