CLASApr 4, 2021

Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers

arXiv:2104.01604v216 citationsHas Code
AI Analysis

This addresses a gap in datasets for voice control applications, but it is incremental as it focuses on a specific domain without broad methodological innovation.

The paper tackles the lack of spoken language understanding datasets for voice commands involving numbers by introducing Timers and Such, an open-source dataset of spoken English commands, and reports experiments with baseline models using ASR-based and end-to-end approaches.

This paper introduces Timers and Such, a new open source dataset of spoken English commands for common voice control use cases involving numbers. We describe the gap in existing spoken language understanding datasets that Timers and Such fills, the design and creation of the dataset, and experiments with a number of ASR-based and end-to-end baseline models, the code for which has been made available as part of the SpeechBrain toolkit.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes