LG CR MLAug 21, 2019

Denoising and Verification Cross-Layer Ensemble Against Black-box Adversarial Attacks

Ka-Ho Chow, Wenqi Wei, Yanzhao Wu, Ling Liu

arXiv:1908.07667v27.118 citations

Originality Incremental advance

AI Analysis

This work addresses the security of DNNs against black-box adversarial attacks, offering an incremental improvement in defense mechanisms for machine learning systems.

The paper tackles the vulnerability of deep neural networks to adversarial attacks by proposing MODEF, a cross-layer ensemble framework that combines denoising and verification to enhance robustness, achieving high defense success rates against eleven attacks on benchmark datasets.

Deep neural networks (DNNs) have demonstrated impressive performance on many challenging machine learning tasks. However, DNNs are vulnerable to adversarial inputs generated by adding maliciously crafted perturbations to the benign inputs. As a growing number of attacks have been reported to generate adversarial inputs of varying sophistication, the defense-attack arms race has been accelerated. In this paper, we present MODEF, a cross-layer model diversity ensemble framework. MODEF intelligently combines unsupervised model denoising ensemble with supervised model verification ensemble by quantifying model diversity, aiming to boost the robustness of the target model against adversarial examples. Evaluated using eleven representative attacks on popular benchmark datasets, we show that MODEF achieves remarkable defense success rates, compared with existing defense methods, and provides a superior capability of repairing adversarial inputs and making correct predictions with high accuracy in the presence of black-box attacks.

View on arXiv PDF

Similar