New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark
This work provides an incremental enhancement for the French research community in spoken language understanding by extending an existing dataset to support more tasks.
The authors tackled the lack of intent annotations in the French MEDIA SLU dataset by creating an enhanced version with semi-automatic methodology, and they reported initial results from joint models for intent classification and slot-filling on this new dataset.
Intent classification and slot-filling are essential tasks of Spoken Language Understanding (SLU). In most SLUsystems, those tasks are realized by independent modules. For about fifteen years, models achieving both of themjointly and exploiting their mutual enhancement have been proposed. A multilingual module using a joint modelwas envisioned to create a touristic dialogue system for a European project, HumanE-AI-Net. A combination ofmultiple datasets, including the MEDIA dataset, was suggested for training this joint model. The MEDIA SLU datasetis a French dataset distributed since 2005 by ELRA, mainly used by the French research community and free foracademic research since 2020. Unfortunately, it is annotated only in slots but not intents. An enhanced version ofMEDIA annotated with intents has been built to extend its use to more tasks and use cases. This paper presents thesemi-automatic methodology used to obtain this enhanced version. In addition, we present the first results of SLUexperiments on this enhanced dataset using joint models for intent classification and slot-filling.