LGCVJun 12, 2021

Adversarial Robustness via Fisher-Rao Regularization

arXiv:2106.06685v329 citations
Originality Incremental advance
AI Analysis

This addresses the problem of adversarial brittleness in machine learning models, offering an incremental improvement with specific gains in efficiency and performance.

The paper tackles adversarial robustness in neural networks by proposing FIRE, a Fisher-Rao regularization method based on information geometry, which improves both clean and robust accuracy by up to 1% and reduces training time by 20% compared to state-of-the-art methods.

Adversarial robustness has become a topic of growing interest in machine learning since it was observed that neural networks tend to be brittle. We propose an information-geometric formulation of adversarial defense and introduce FIRE, a new Fisher-Rao regularization for the categorical cross-entropy loss, which is based on the geodesic distance between the softmax outputs corresponding to natural and perturbed input features. Based on the information-geometric properties of the class of softmax distributions, we derive an explicit characterization of the Fisher-Rao Distance (FRD) for the binary and multiclass cases, and draw some interesting properties as well as connections with standard regularization metrics. Furthermore, for a simple linear and Gaussian model, we show that all Pareto-optimal points in the accuracy-robustness region can be reached by FIRE while other state-of-the-art methods fail. Empirically, we evaluate the performance of various classifiers trained with the proposed loss on standard datasets, showing up to a simultaneous 1\% of improvement in terms of clean and robust performances while reducing the training time by 20\% over the best-performing methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes