CLOct 20, 2023

Arabic Dialect Identification under Scrutiny: Limitations of Single-label Classification

arXiv:2310.13661v1142 citationsh-index: 6
Originality Synthesis-oriented
AI Analysis

This addresses a domain-specific issue for researchers and practitioners in Arabic NLP by highlighting incremental improvements in dataset design and task framing.

The paper tackles the problem of Arabic Dialect Identification (ADI) systems failing to distinguish micro-dialects by arguing that single-label classification is a key limitation, and it found that approximately 66% of validated errors in predictions are not true errors.

Automatic Arabic Dialect Identification (ADI) of text has gained great popularity since it was introduced in the early 2010s. Multiple datasets were developed, and yearly shared tasks have been running since 2018. However, ADI systems are reported to fail in distinguishing between the micro-dialects of Arabic. We argue that the currently adopted framing of the ADI task as a single-label classification problem is one of the main reasons for that. We highlight the limitation of the incompleteness of the Dialect labels and demonstrate how it impacts the evaluation of ADI systems. A manual error analysis for the predictions of an ADI, performed by 7 native speakers of different Arabic dialects, revealed that $\approx$ 66% of the validated errors are not true errors. Consequently, we propose framing ADI as a multi-label classification task and give recommendations for designing new ADI datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes