LGCRFeb 27, 2021

Effective Universal Unrestricted Adversarial Attacks using a MOE Approach

arXiv:2103.00250v18 citations
Originality Incremental advance
AI Analysis

This work addresses security vulnerabilities in deep learning models for image classification, though it is incremental as it builds on existing adversarial attack methods.

The paper tackled the problem of generating universal unrestricted adversarial examples in a black-box scenario using a multi-objective nested evolutionary algorithm, resulting in sequences of image filters that achieve high attack success rates while maintaining low detection rates.

Recent studies have shown that Deep Leaning models are susceptible to adversarial examples, which are data, in general images, intentionally modified to fool a machine learning classifier. In this paper, we present a multi-objective nested evolutionary algorithm to generate universal unrestricted adversarial examples in a black-box scenario. The unrestricted attacks are performed through the application of well-known image filters that are available in several image processing libraries, modern cameras, and mobile applications. The multi-objective optimization takes into account not only the attack success rate but also the detection rate. Experimental results showed that this approach is able to create a sequence of filters capable of generating very effective and undetectable attacks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes