LGCROCMLApr 3, 2019

HopSkipJumpAttack: A Query-Efficient Decision-Based Attack

arXiv:1904.02144v5812 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of query-efficient adversarial attacks for machine learning security, representing an incremental improvement over existing methods.

The paper tackles the problem of decision-based adversarial attacks on trained models by developing HopSkipJumpAttack, a family of algorithms that use binary information at the decision boundary to estimate gradient direction, resulting in significantly fewer model queries than Boundary Attack and competitive performance against defense mechanisms.

The goal of a decision-based adversarial attack on a trained model is to generate adversarial examples based solely on observing output labels returned by the targeted model. We develop HopSkipJumpAttack, a family of algorithms based on a novel estimate of the gradient direction using binary information at the decision boundary. The proposed family includes both untargeted and targeted attacks optimized for $\ell_2$ and $\ell_\infty$ similarity metrics respectively. Theoretical analysis is provided for the proposed algorithms and the gradient direction estimate. Experiments show HopSkipJumpAttack requires significantly fewer model queries than Boundary Attack. It also achieves competitive performance in attacking several widely-used defense mechanisms. (HopSkipJumpAttack was named Boundary Attack++ in a previous version of the preprint.)

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes