LGAICRMLNov 13, 2019

Adversarial Examples in Modern Machine Learning: A Review

arXiv:1911.05268v2118 citations
Originality Synthesis-oriented
AI Analysis

This is a survey paper that synthesizes existing research on adversarial examples, which is an incremental contribution aimed at researchers in machine learning security.

The paper reviews the vulnerability of machine learning models to adversarial examples, focusing on visual domains, and provides an extensive overview of attack methods, defenses, and transferability properties.

Recent research has found that many families of machine learning models are vulnerable to adversarial examples: inputs that are specifically designed to cause the target model to produce erroneous outputs. In this survey, we focus on machine learning models in the visual domain, where methods for generating and detecting such examples have been most extensively studied. We explore a variety of adversarial attack methods that apply to image-space content, real world adversarial attacks, adversarial defenses, and the transferability property of adversarial examples. We also discuss strengths and weaknesses of various methods of adversarial attack and defense. Our aim is to provide an extensive coverage of the field, furnishing the reader with an intuitive understanding of the mechanics of adversarial attack and defense mechanisms and enlarging the community of researchers studying this fundamental set of problems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes