LGJul 26, 2023

Characterizing Data Point Vulnerability via Average-Case Robustness

Harvard
arXiv:2307.13885v63 citationsh-index: 43
Originality Incremental advance
AI Analysis

This work addresses the need for more nuanced robustness assessment in ML, offering tools to improve model behavior characterization, though it is incremental as it builds on existing robustness frameworks.

The paper tackles the problem of characterizing data point vulnerability in machine learning models by introducing average-case robustness as a complementary framework to adversarial robustness, and proposes the first analytical estimators for this measure, showing they are accurate and efficient for deep learning models in identifying vulnerable points and quantifying robustness bias.

Studying the robustness of machine learning models is important to ensure consistent model behaviour across real-world settings. To this end, adversarial robustness is a standard framework, which views robustness of predictions through a binary lens: either a worst-case adversarial misclassification exists in the local region around an input, or it does not. However, this binary perspective does not account for the degrees of vulnerability, as data points with a larger number of misclassified examples in their neighborhoods are more vulnerable. In this work, we consider a complementary framework for robustness, called average-case robustness, which measures the fraction of points in a local region that provides consistent predictions. However, computing this quantity is hard, as standard Monte Carlo approaches are inefficient especially for high-dimensional inputs. In this work, we propose the first analytical estimators for average-case robustness for multi-class classifiers. We show empirically that our estimators are accurate and efficient for standard deep learning models and demonstrate their usefulness for identifying vulnerable data points, as well as quantifying robustness bias of models. Overall, our tools provide a complementary view to robustness, improving our ability to characterize model behaviour.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes