SDCRASDec 26, 2018

A Multiversion Programming Inspired Approach to Detecting Audio Adversarial Examples

arXiv:1812.10199v252 citations
Originality Incremental advance
AI Analysis

This addresses the threat of audio adversarial examples to machine learning trustworthiness, particularly for ASR systems, though it is an incremental improvement leveraging existing methods.

The paper tackles the problem of detecting audio adversarial examples by proposing a novel approach inspired by Multiversion Programming, which uses multiple off-the-shelf ASR systems to identify adversarial inputs, achieving detection accuracies over 98.6%.

Adversarial examples (AEs) are crafted by adding human-imperceptible perturbations to inputs such that a machine-learning based classifier incorrectly labels them. They have become a severe threat to the trustworthiness of machine learning. While AEs in the image domain have been well studied, audio AEs are less investigated. Recently, multiple techniques are proposed to generate audio AEs, which makes countermeasures against them an urgent task. Our experiments show that, given an AE, the transcription results by different Automatic Speech Recognition (ASR) systems differ significantly, as they use different architectures, parameters, and training datasets. Inspired by Multiversion Programming, we propose a novel audio AE detection approach, which utilizes multiple off-the-shelf ASR systems to determine whether an audio input is an AE. The evaluation shows that the detection achieves accuracies over 98.6%.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes