Almawave-SLU: A new dataset for SLU in Italian
This provides a new dataset for researchers and developers working on Italian conversational AI, but it is incremental as it applies an existing method to new data.
The authors tackled the lack of labeled data for Spoken Language Understanding (SLU) in Italian by creating the first Italian SLU dataset, which was used to benchmark various systems.
The widespread use of conversational and question answering systems made it necessary to improve the performances of speaker intent detection and understanding of related semantic slots, i.e., Spoken Language Understanding (SLU). Often, these tasks are approached with supervised learning methods, which needs considerable labeled datasets. This paper presents the first Italian dataset for SLU. It is derived through a semi-automatic procedure and is used as a benchmark of various open source and commercial systems.