ASCLLGSDMLOct 9, 2018

Federated Learning for Keyword Spotting

arXiv:1810.05512v4314 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This work addresses practical deployment challenges for smart home voice assistants, though it is incremental as it adapts existing methods to a specific domain.

The authors tackled the problem of out-of-domain issues in embedded speech models like wake word detectors by applying federated learning, demonstrating that an adaptive averaging strategy reduces communication rounds by 8 MB per user to reach target performance.

We propose a practical approach based on federated learning to solve out-of-domain issues with continuously running embedded speech-based models such as wake word detectors. We conduct an extensive empirical study of the federated averaging algorithm for the "Hey Snips" wake word based on a crowdsourced dataset that mimics a federation of wake word users. We empirically demonstrate that using an adaptive averaging strategy inspired from Adam in place of standard weighted model averaging highly reduces the number of communication rounds required to reach our target performance. The associated upstream communication costs per user are estimated at 8 MB, which is a reasonable in the context of smart home voice assistants. Additionally, the dataset used for these experiments is being open sourced with the aim of fostering further transparent research in the application of federated learning to speech data.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes