CLJun 14, 2024

Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness

Maximilian Spliethöver, Sai Nikhil Menon, Henning Wachsmuth

arXiv:2406.09977v114.126 citations

Originality Incremental advance

AI Analysis

This work addresses fairness issues in NLP for dialect speakers, focusing on mitigating bias in language detection tasks, though it is incremental as it builds on existing multitask learning methods.

The paper tackled unfair NLP behavior towards dialect speakers by investigating performance disparities in biased language detection across dialects and proposed a multitask learning approach to mitigate these disparities. The results showed that complementing common learning approaches with dialect modeling improved fairness and achieved state-of-the-art performance in detecting biased language.

Dialects introduce syntactic and lexical variations in language that occur in regional or social groups. Most NLP methods are not sensitive to such variations. This may lead to unfair behavior of the methods, conveying negative bias towards dialect speakers. While previous work has studied dialect-related fairness for aspects like hate speech, other aspects of biased language, such as lewdness, remain fully unexplored. To fill this gap, we investigate performance disparities between dialects in the detection of five aspects of biased language and how to mitigate them. To alleviate bias, we present a multitask learning approach that models dialect language as an auxiliary task to incorporate syntactic and lexical variations. In our experiments with African-American English dialect, we provide empirical evidence that complementing common learning approaches with dialect modeling improves their fairness. Furthermore, the results suggest that multitask learning achieves state-of-the-art performance and helps to detect properties of biased language more reliably.

View on arXiv PDF

Similar