MLLGJun 30, 2020

Black-box Certification and Learning under Adversarial Perturbations

arXiv:2006.16520v221 citations
AI Analysis

This work addresses adversarial robustness for machine learning systems, providing theoretical insights that are incremental to existing frameworks.

The paper tackles the problem of classification under adversarial perturbations from both learning and certification perspectives, establishing possibility and impossibility results for proper learning of VC-classes and introducing a black-box certification setting with query budget constraints.

We formally study the problem of classification under adversarial perturbations from a learner's perspective as well as a third-party who aims at certifying the robustness of a given black-box classifier. We analyze a PAC-type framework of semi-supervised learning and identify possibility and impossibility results for proper learning of VC-classes in this setting. We further introduce a new setting of black-box certification under limited query budget, and analyze this for various classes of predictors and perturbation. We also consider the viewpoint of a black-box adversary that aims at finding adversarial examples, showing that the existence of an adversary with polynomial query complexity can imply the existence of a sample efficient robust learner.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes