LGCROct 22, 2022

Hindering Adversarial Attacks with Implicit Neural Representations

arXiv:2210.13982v15 citationsh-index: 50Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of adversarial vulnerability in machine learning models for security-critical applications, though it is incremental as it builds on existing input transformation methods.

The paper tackles adversarial attacks on image classifiers by introducing the LINAC defence, which uses implicit neural representations to encode images, achieving robustness to perturbations up to ε=8/255 in L∞ and ε=0.5 in L2 norms on CIFAR-10 without adversarial training or significant performance drops.

We introduce the Lossy Implicit Network Activation Coding (LINAC) defence, an input transformation which successfully hinders several common adversarial attacks on CIFAR-$10$ classifiers for perturbations up to $ε= 8/255$ in $L_\infty$ norm and $ε= 0.5$ in $L_2$ norm. Implicit neural representations are used to approximately encode pixel colour intensities in $2\text{D}$ images such that classifiers trained on transformed data appear to have robustness to small perturbations without adversarial training or large drops in performance. The seed of the random number generator used to initialise and train the implicit neural representation turns out to be necessary information for stronger generic attacks, suggesting its role as a private key. We devise a Parametric Bypass Approximation (PBA) attack strategy for key-based defences, which successfully invalidates an existing method in this category. Interestingly, our LINAC defence also hinders some transfer and adaptive attacks, including our novel PBA strategy. Our results emphasise the importance of a broad range of customised attacks despite apparent robustness according to standard evaluations. LINAC source code and parameters of defended classifier evaluated throughout this submission are available: https://github.com/deepmind/linac

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes