CLJun 14, 2024

Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness

arXiv:2406.09977v126 citations
Originality Incremental advance
AI Analysis

This work addresses fairness issues in NLP for dialect speakers, focusing on mitigating bias in language detection tasks, though it is incremental as it builds on existing multitask learning methods.

The paper tackled unfair NLP behavior towards dialect speakers by investigating performance disparities in biased language detection across dialects and proposed a multitask learning approach to mitigate these disparities. The results showed that complementing common learning approaches with dialect modeling improved fairness and achieved state-of-the-art performance in detecting biased language.

Dialects introduce syntactic and lexical variations in language that occur in regional or social groups. Most NLP methods are not sensitive to such variations. This may lead to unfair behavior of the methods, conveying negative bias towards dialect speakers. While previous work has studied dialect-related fairness for aspects like hate speech, other aspects of biased language, such as lewdness, remain fully unexplored. To fill this gap, we investigate performance disparities between dialects in the detection of five aspects of biased language and how to mitigate them. To alleviate bias, we present a multitask learning approach that models dialect language as an auxiliary task to incorporate syntactic and lexical variations. In our experiments with African-American English dialect, we provide empirical evidence that complementing common learning approaches with dialect modeling improves their fairness. Furthermore, the results suggest that multitask learning achieves state-of-the-art performance and helps to detect properties of biased language more reliably.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes