LGCRCVMLApr 26, 2022

On Fragile Features and Batch Normalization in Adversarial Training

arXiv:2204.12393v15 citationsh-index: 137
Originality Synthesis-oriented
AI Analysis

This addresses the problem of understanding BN's impact on adversarial robustness for deep learning practitioners, but it is incremental as it builds on prior work about BN and fragile features.

The study investigated the role of batch normalization (BN) in adversarial training, finding that fine-tuning BN layers on CIFAR10 can achieve non-trivial adversarial robustness, while training them from scratch does not.

Modern deep learning architecture utilize batch normalization (BN) to stabilize training and improve accuracy. It has been shown that the BN layers alone are surprisingly expressive. In the context of robustness against adversarial examples, however, BN is argued to increase vulnerability. That is, BN helps to learn fragile features. Nevertheless, BN is still used in adversarial training, which is the de-facto standard to learn robust features. In order to shed light on the role of BN in adversarial training, we investigate to what extent the expressiveness of BN can be used to robustify fragile features in comparison to random features. On CIFAR10, we find that adversarially fine-tuning just the BN layers can result in non-trivial adversarial robustness. Adversarially training only the BN layers from scratch, in contrast, is not able to convey meaningful adversarial robustness. Our results indicate that fragile features can be used to learn models with moderate adversarial robustness, while random features cannot

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes