CVCRLGRODec 5, 2018

SADA: Semantic Adversarial Diagnostic Attacks for Autonomous Applications

arXiv:1812.02132v330 citations
Originality Highly original
AI Analysis

This addresses the problem of DNN robustness for autonomous systems, offering a novel approach beyond pixel-level attacks, though it is incremental in extending adversarial methods to semantic perturbations.

The authors tackled the lack of robustness in deep neural networks for safety-critical applications like autonomous driving by introducing a framework for semantic adversarial attacks that generate environment perturbations to fool trained agents, achieving consistent failure cases across tasks such as object detection and self-driving.

One major factor impeding more widespread adoption of deep neural networks (DNNs) is their lack of robustness, which is essential for safety-critical applications such as autonomous driving. This has motivated much recent work on adversarial attacks for DNNs, which mostly focus on pixel-level perturbations void of semantic meaning. In contrast, we present a general framework for adversarial attacks on trained agents, which covers semantic perturbations to the environment of the agent performing the task as well as pixel-level attacks. To do this, we re-frame the adversarial attack problem as learning a distribution of parameters that always fools the agent. In the semantic case, our proposed adversary (denoted as BBGAN) is trained to sample parameters that describe the environment with which the black-box agent interacts, such that the agent performs its dedicated task poorly in this environment. We apply BBGAN on three different tasks, primarily targeting aspects of autonomous navigation: object detection, self-driving, and autonomous UAV racing. On these tasks, BBGAN can generate failure cases that consistently fool a trained agent.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes