AS CL SDJun 4, 2020

AP20-OLR Challenge: Three Tasks and Their Baselines

Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Liming Song, Cheng Yang

arXiv:2006.03473v48.640 citations

Originality Synthesis-oriented

AI Analysis

This work addresses practical challenges in language recognition for researchers and developers, but it is incremental as it builds on previous OLR challenges with new data and tasks.

The paper introduces the AP20-OLR challenge, which aims to improve language recognition systems by focusing on three practical tasks: cross-channel language identification, dialect identification, and noisy language identification, with baseline results showing these tasks require further effort for better performance.

This paper introduces the fifth oriental language recognition (OLR) challenge AP20-OLR, which intends to improve the performance of language recognition systems, along with APSIPA Annual Summit and Conference (APSIPA ASC). The data profile, three tasks, the corresponding baselines, and the evaluation principles are introduced in this paper. The AP20-OLR challenge includes more languages, dialects and real-life data provided by Speechocean and the NSFC M2ASR project, and all the data is free for participants. The challenge this year still focuses on practical and challenging problems, with three tasks: (1) cross-channel LID, (2) dialect identification and (3) noisy LID. Based on Kaldi and Pytorch, recipes for i-vector and x-vector systems are also conducted as baselines for the three tasks. These recipes will be online-published, and available for participants to configure LID systems. The baseline results on the three tasks demonstrate that those tasks in this challenge are worth paying more efforts to achieve better performance.

View on arXiv PDF

Similar