LGAICRMLOct 28, 2022

Universal Adversarial Directions

arXiv:2210.15997v21 citationsh-index: 15
Originality Incremental advance
AI Analysis

This work addresses the transferability issue in adversarial attacks for deep neural networks, which is an incremental improvement in the field of AI security.

The paper tackled the problem of universal adversarial perturbations (UAPs) struggling to transfer across deep neural network architectures by proposing Universal Adversarial Directions (UADs), which fix a universal direction for perturbations and allow free magnitude choice, resulting in superior transferability over standard UAPs as shown in evaluations on multiple benchmark image datasets.

Despite their great success in image recognition tasks, deep neural networks (DNNs) have been observed to be susceptible to universal adversarial perturbations (UAPs) which perturb all input samples with a single perturbation vector. However, UAPs often struggle in transferring across DNN architectures and lead to challenging optimization problems. In this work, we study the transferability of UAPs by analyzing equilibrium in the universal adversarial example game between the classifier and UAP adversary players. We show that under mild assumptions the universal adversarial example game lacks a pure Nash equilibrium, indicating UAPs' suboptimal transferability across DNN classifiers. To address this issue, we propose Universal Adversarial Directions (UADs) which only fix a universal direction for adversarial perturbations and allow the perturbations' magnitude to be chosen freely across samples. We prove that the UAD adversarial example game can possess a Nash equilibrium with a pure UAD strategy, implying the potential transferability of UADs. We also connect the UAD optimization problem to the well-known principal component analysis (PCA) and develop an efficient PCA-based algorithm for optimizing UADs. We evaluate UADs over multiple benchmark image datasets. Our numerical results show the superior transferability of UADs over standard gradient-based UAPs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes