ASCRSDMar 31, 2021

Adversarial Attacks and Defenses for Speech Recognition Systems

arXiv:2103.17122v140 citations
Originality Incremental advance
AI Analysis

This work addresses security vulnerabilities in speech recognition systems, which are critical for applications like voice assistants, but it is incremental as it builds on existing adversarial attack and defense methods.

The paper investigates adversarial attacks and defenses for automatic speech recognition systems, finding limited attack transferability between models and showing that preprocessing defenses like randomized smoothing and a WaveGAN vocoder significantly improve robustness, with the vocoder maintaining high word error rates for targeted phrases even under joint attack.

The ubiquitous presence of machine learning systems in our lives necessitates research into their vulnerabilities and appropriate countermeasures. In particular, we investigate the effectiveness of adversarial attacks and defenses against automatic speech recognition (ASR) systems. We select two ASR models - a thoroughly studied DeepSpeech model and a more recent Espresso framework Transformer encoder-decoder model. We investigate two threat models: a denial-of-service scenario where fast gradient-sign method (FGSM) or weak projected gradient descent (PGD) attacks are used to degrade the model's word error rate (WER); and a targeted scenario where a more potent imperceptible attack forces the system to recognize a specific phrase. We find that the attack transferability across the investigated ASR systems is limited. To defend the model, we use two preprocessing defenses: randomized smoothing and WaveGAN-based vocoder, and find that they significantly improve the model's adversarial robustness. We show that a WaveGAN vocoder can be a useful countermeasure to adversarial attacks on ASR systems - even when it is jointly attacked with the ASR, the target phrases' word error rate is high.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes