SDAIASApr 22, 2025

TinyML for Speech Recognition

arXiv:2504.16213v17 citationsh-index: 1COMPSAC
Originality Synthesis-oriented
AI Analysis

This enables complex voice commands for IoT applications like smart homes and assisted living, but it is incremental as it builds on existing TinyML methods.

The paper tackles speech recognition on resource-constrained IoT edge devices by training a quantized 1D CNN model, achieving up to 97% accuracy on a new dataset with 23 keywords.

We train and deploy a quantized 1D convolutional neural network model to conduct speech recognition on a highly resource-constrained IoT edge device. This can be useful in various Internet of Things (IoT) applications, such as smart homes and ambient assisted living for the elderly and people with disabilities, just to name a few examples. In this paper, we first create a new dataset with over one hour of audio data that enables our research and will be useful to future studies in this field. Second, we utilize the technologies provided by Edge Impulse to enhance our model's performance and achieve a high Accuracy of up to 97% on our dataset. For the validation, we implement our prototype using the Arduino Nano 33 BLE Sense microcontroller board. This microcontroller board is specifically designed for IoT and AI applications, making it an ideal choice for our target use case scenarios. While most existing research focuses on a limited set of keywords, our model can process 23 different keywords, enabling complex commands.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes