LGCRCVAug 1, 2022

Attacking Adversarial Defences by Smoothing the Loss Landscape

arXiv:2208.00862v25 citationsh-index: 77
Originality Incremental advance
AI Analysis

It addresses the challenge of bypassing adversarial defenses for researchers and practitioners in machine learning security, but is incremental as it builds on existing gradient-based and gradient-free attack methods.

This paper tackles the problem of adversarial defenses that create rugged loss landscapes to hinder attacks, by proposing a loss-smoothing method using the Weierstrass transform to improve gradient estimates and strengthen adversaries, demonstrating efficacy against various defenses.

This paper investigates a family of methods for defending against adversarial attacks that owe part of their success to creating a noisy, discontinuous, or otherwise rugged loss landscape that adversaries find difficult to navigate. A common, but not universal, way to achieve this effect is via the use of stochastic neural networks. We show that this is a form of gradient obfuscation, and propose a general extension to gradient-based adversaries based on the Weierstrass transform, which smooths the surface of the loss function and provides more reliable gradient estimates. We further show that the same principle can strengthen gradient-free adversaries. We demonstrate the efficacy of our loss-smoothing method against both stochastic and non-stochastic adversarial defences that exhibit robustness due to this type of obfuscation. Furthermore, we provide analysis of how it interacts with Expectation over Transformation; a popular gradient-sampling method currently used to attack stochastic defences.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes