AS CR LG SDAug 17, 2024

Malacopula: adversarial automatic speaker verification attacks using a neural-based generalised Hammerstein model

Massimiliano Todisco, Michele Panariello, Xin Wang, Héctor Delgado, Kong Aik Lee, Nicholas Evans

arXiv:2408.09300v18.018 citationsh-index: 24Has Code

Originality Incremental advance

AI Analysis

This work addresses security vulnerabilities in ASV systems, though it is incremental as it builds on existing adversarial attack methods for speech.

The authors tackled the problem of deceiving automatic speaker verification (ASV) systems by developing Malacopula, a neural-based model that introduces adversarial perturbations to spoofed speech, resulting in increased vulnerabilities by a substantial margin in experiments with three ASV systems.

We present Malacopula, a neural-based generalised Hammerstein model designed to introduce adversarial perturbations to spoofed speech utterances so that they better deceive automatic speaker verification (ASV) systems. Using non-linear processes to modify speech utterances, Malacopula enhances the effectiveness of spoofing attacks. The model comprises parallel branches of polynomial functions followed by linear time-invariant filters. The adversarial optimisation procedure acts to minimise the cosine distance between speaker embeddings extracted from spoofed and bona fide utterances. Experiments, performed using three recent ASV systems and the ASVspoof 2019 dataset, show that Malacopula increases vulnerabilities by a substantial margin. However, speech quality is reduced and attacks can be detected effectively under controlled conditions. The findings emphasise the need to identify new vulnerabilities and design defences to protect ASV systems from adversarial attacks in the wild.

View on arXiv PDF Code

Similar