AP18-OLR Challenge: Three Tasks and Their Baselines
This work provides a benchmark for researchers in speech recognition to tackle more realistic and difficult language identification problems, but it is incremental as it builds on previous annual challenges.
The paper introduces the AP18-OLR challenge, focusing on three challenging tasks in oriental language recognition—short-duration utterances, confusing languages, and open-set recognition—and reports baseline results using i-vector and neural network models to demonstrate the difficulty of these tasks.
The third oriental language recognition (OLR) challenge AP18-OLR is introduced in this paper, including the data profile, the tasks and the evaluation principles. Following the events in the last two years, namely AP16-OLR and AP17-OLR, the challenge this year focuses on more challenging tasks, including (1) short-duration utterances, (2) confusing languages, and (3) open-set recognition. The same as the previous events, the data of AP18-OLR is also provided by SpeechOcean and the NSFC M2ASR project. Baselines based on both the i-vector model and neural networks are constructed for the participants' reference. We report the baseline results on the three tasks and demonstrate that the three tasks are truly challenging. All the data is free for participants, and the Kaldi recipes for the baselines have been published online.