LGDSMLNov 12, 2019

On Robustness to Adversarial Examples and Polynomial Optimization

arXiv:1911.04681v134 citations
Originality Highly original
AI Analysis

This work addresses the theoretical gap in understanding robust learning algorithms for adversarial examples, which is crucial for improving the security and reliability of machine learning systems, particularly deep networks, though it is incremental in extending theoretical foundations to specific hypothesis classes.

The paper tackles the problem of designing computationally efficient algorithms with provable robustness to adversarial perturbations, focusing on linear classifiers and degree-2 polynomial threshold functions. It connects robustness to polynomial optimization, enabling efficient robust algorithms, precise characterization of robustness costs, and principled methods for certifying robustness and generating attacks, with empirical validation on real data.

We study the design of computationally efficient algorithms with provable guarantees, that are robust to adversarial (test time) perturbations. While there has been an proliferation of recent work on this topic due to its connections to test time robustness of deep networks, there is limited theoretical understanding of several basic questions like (i) when and how can one design provably robust learning algorithms? (ii) what is the price of achieving robustness to adversarial examples in a computationally efficient manner? The main contribution of this work is to exhibit a strong connection between achieving robustness to adversarial examples, and a rich class of polynomial optimization problems, thereby making progress on the above questions. In particular, we leverage this connection to (a) design computationally efficient robust algorithms with provable guarantees for a large class of hypothesis, namely linear classifiers and degree-2 polynomial threshold functions (PTFs), (b) give a precise characterization of the price of achieving robustness in a computationally efficient manner for these classes, (c) design efficient algorithms to certify robustness and generate adversarial attacks in a principled manner for 2-layer neural networks. We empirically demonstrate the effectiveness of these attacks on real data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes