CLNov 13, 2025

ELYADATA & LIA at NADI 2025: ASR and ADI Subtasks

Haroun Elleuch, Youssef Saidi, Salima Mdhaffar, Yannick Estève, Fethi Bougares

arXiv:2511.10090v14 citationsh-index: 19Proceedings of The Third Arabic Natural Language Processing Conference: Shared Tasks

Originality Synthesis-oriented

AI Analysis

This work addresses dialect-specific challenges in Arabic speech processing, but it is incremental as it applies fine-tuning of existing models to new tasks.

The paper tackled the problem of spoken Arabic dialect identification and multi-dialectal Arabic automatic speech recognition, achieving first place in ADI with 79.83% accuracy and second place in ASR with average WER of 38.54% and CER of 14.53%.

This paper describes Elyadata \& LIA's joint submission to the NADI multi-dialectal Arabic Speech Processing 2025. We participated in the Spoken Arabic Dialect Identification (ADI) and multi-dialectal Arabic ASR subtasks. Our submission ranked first for the ADI subtask and second for the multi-dialectal Arabic ASR subtask among all participants. Our ADI system is a fine-tuned Whisper-large-v3 encoder with data augmentation. This system obtained the highest ADI accuracy score of \textbf{79.83\%} on the official test set. For multi-dialectal Arabic ASR, we fine-tuned SeamlessM4T-v2 Large (Egyptian variant) separately for each of the eight considered dialects. Overall, we obtained an average WER and CER of \textbf{38.54\%} and \textbf{14.53\%}, respectively, on the test set. Our results demonstrate the effectiveness of large pre-trained speech models with targeted fine-tuning for Arabic speech processing.

View on arXiv PDF

Similar