CLAISep 28, 2021

Exploring Teacher-Student Learning Approach for Multi-lingual Speech-to-Intent Classification

arXiv:2109.13486v1
Originality Incremental advance
AI Analysis

This work addresses the challenge of multi-lingual intent classification for speech systems, but it is incremental as it builds on existing teacher-student and mBERT techniques.

The paper tackled the problem of multi-lingual speech-to-intent classification by using a teacher-student learning approach to transfer knowledge from a pre-trained mBERT model to a speech model, achieving an improved accuracy of 91.02% compared to 89.40% with a traditional method.

End-to-end speech-to-intent classification has shown its advantage in harvesting information from both text and speech. In this paper, we study a technique to develop such an end-to-end system that supports multiple languages. To overcome the scarcity of multi-lingual speech corpus, we exploit knowledge from a pre-trained multi-lingual natural language processing model. Multi-lingual bidirectional encoder representations from transformers (mBERT) models are trained on multiple languages and hence expected to perform well in the multi-lingual scenario. In this work, we employ a teacher-student learning approach to sufficiently extract information from an mBERT model to train a multi-lingual speech model. In particular, we use synthesized speech generated from an English-Mandarin text corpus for analysis and training of a multi-lingual intent classification model. We also demonstrate that the teacher-student learning approach obtains an improved performance (91.02%) over the traditional end-to-end (89.40%) intent classification approach in a practical multi-lingual scenario.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes