ASSDOct 15, 2020

Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark

arXiv:2010.07913v134 citations
Originality Synthesis-oriented
AI Analysis

This addresses dataset biases in speaker verification spoofing detection, offering incremental improvements for researchers in speech biometric security.

The study identified that artefacts in the ASVspoof 2017 dataset may artificially inflate the success of anti-spoofing systems, and proposed discarding nonspeech segments to improve robustness, providing new benchmark results.

The Automatic Speaker Verification Spoofing and Countermeasures Challenges motivate research in protecting speech biometric systems against a variety of different access attacks. The 2017 edition focused on replay spoofing attacks, and involved participants building and training systems on a provided dataset (ASVspoof 2017). More than 60 research papers have so far been published with this dataset, but none have sought to answer why countermeasures appear successful in detecting spoofing attacks. This article shows how artefacts inherent to the dataset may be contributing to the apparent success of published systems. We first inspect the ASVspoof 2017 dataset and summarize various artefacts present in the dataset. Second, we demonstrate how countermeasure models can exploit these artefacts to appear successful in this dataset. Third, for reliable and robust performance estimates on this dataset we propose discarding nonspeech segments and silence before and after the speech utterance during training and inference. We create speech start and endpoint annotations in the dataset and demonstrate how using them helps countermeasure models become less vulnerable from being manipulated using artefacts found in the dataset. Finally, we provide several new benchmark results for both frame-level and utterance-level models that can serve as new baselines on this dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes