LGMLFeb 24, 2020

A Model-Based Derivative-Free Approach to Black-Box Adversarial Examples: BOBYQA

arXiv:2002.10349v1
AI Analysis

This work addresses the challenge of efficiently finding adversarial examples for deep networks in security-critical applications, but it is incremental as it adapts an existing optimization method to this domain.

The paper tackles the problem of generating adversarial examples in black-box settings by using a model-based derivative-free optimization algorithm (BOBYQA), which reduces the number of network queries needed compared to model-free methods, especially when perturbation energy is low or networks are adversarially trained.

We demonstrate that model-based derivative free optimisation algorithms can generate adversarial targeted misclassification of deep networks using fewer network queries than non-model-based methods. Specifically, we consider the black-box setting, and show that the number of networks queries is less impacted by making the task more challenging either through reducing the allowed $\ell^{\infty}$ perturbation energy or training the network with defences against adversarial misclassification. We illustrate this by contrasting the BOBYQA algorithm with the state-of-the-art model-free adversarial targeted misclassification approaches based on genetic, combinatorial, and direct-search algorithms. We observe that for high $\ell^{\infty}$ energy perturbations on networks, the aforementioned simpler model-free methods require the fewest queries. In contrast, the proposed BOBYQA based method achieves state-of-the-art results when the perturbation energy decreases, or if the network is trained against adversarial perturbations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes