CRLGSDASFeb 11, 2022

On the Detection of Adaptive Adversarial Attacks in Speaker Verification Systems

arXiv:2202.05725v211 citations
Originality Incremental advance
AI Analysis

This addresses security vulnerabilities in speaker verification systems used in smart devices, though it is incremental as it builds on known attack methods.

The paper tackled the problem of detecting adversarial attacks like FAKEBOB in speaker verification systems by proposing a detector called MEH-FEST, which calculates minimum energy in high frequencies from audio, and achieved near zero false positive and false negative rates in experiments on GMM and i-vector systems.

Speaker verification systems have been widely used in smart phones and Internet of things devices to identify legitimate users. In recent work, it has been shown that adversarial attacks, such as FAKEBOB, can work effectively against speaker verification systems. The goal of this paper is to design a detector that can distinguish an original audio from an audio contaminated by adversarial attacks. Specifically, our designed detector, called MEH-FEST, calculates the minimum energy in high frequencies from the short-time Fourier transform of an audio and uses it as a detection metric. Through both analysis and experiments, we show that our proposed detector is easy to implement, fast to process an input audio, and effective in determining whether an audio is corrupted by FAKEBOB attacks. The experimental results indicate that the detector is extremely effective: with near zero false positive and false negative rates for detecting FAKEBOB attacks in Gaussian mixture model (GMM) and i-vector speaker verification systems. Moreover, adaptive adversarial attacks against our proposed detector and their countermeasures are discussed and studied, showing the game between attackers and defenders.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes