LGAIJun 8, 2024

Enhancing Adversarial Transferability via Information Bottleneck Constraints

arXiv:2406.05531v11 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of making adversarial attacks more effective across different models, which is important for security testing, but it is incremental as it builds on existing methods with a novel theoretical approach.

The paper tackles the problem of improving black-box transferable adversarial attacks by proposing a framework based on information bottleneck theory, which enhances transferability by focusing on invariant features, achieving competitive results on the ImageNet dataset.

From the perspective of information bottleneck (IB) theory, we propose a novel framework for performing black-box transferable adversarial attacks named IBTA, which leverages advancements in invariant features. Intuitively, diminishing the reliance of adversarial perturbations on the original data, under equivalent attack performance constraints, encourages a greater reliance on invariant features that contributes most to classification, thereby enhancing the transferability of adversarial attacks. Building on this motivation, we redefine the optimization of transferable attacks using a novel theoretical framework that centers around IB. Specifically, to overcome the challenge of unoptimizable mutual information, we propose a simple and efficient mutual information lower bound (MILB) for approximating computation. Moreover, to quantitatively evaluate mutual information, we utilize the Mutual Information Neural Estimator (MINE) to perform a thorough analysis. Our experiments on the ImageNet dataset well demonstrate the efficiency and scalability of IBTA and derived MILB. Our code is available at https://github.com/Biqing-Qi/Enhancing-Adversarial-Transferability-via-Information-Bottleneck-Constraints.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes