CVLGAug 26, 2019

A Statistical Defense Approach for Detecting Adversarial Examples

arXiv:1908.09705v12 citations
Originality Incremental advance
AI Analysis

This addresses the security issue of adversarial attacks for deep neural network applications, but it is incremental as it builds on existing defensive strategies.

The paper tackles the problem of detecting adversarial examples that fool deep neural networks by developing a detector that uses statistical information from the training set to build signatures from distorted replicas of test inputs. The method reliably detects malicious inputs and outperforms state-of-the-art approaches in various settings.

Adversarial examples are maliciously modified inputs created to fool deep neural networks (DNN). The discovery of such inputs presents a major issue to the expansion of DNN-based solutions. Many researchers have already contributed to the topic, providing both cutting edge-attack techniques and various defensive strategies. In this work, we focus on the development of a system capable of detecting adversarial samples by exploiting statistical information from the training-set. Our detector computes several distorted replicas of the test input, then collects the classifier's prediction vectors to build a meaningful signature for the detection task. Then, the signature is projected onto the class-specific statistic vector to infer the input's nature. The classification output of the original input is used to select the class-statistic vector. We show that our method reliably detects malicious inputs, outperforming state-of-the-art approaches in various settings, while being complementary to other defensive solutions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes