CVAug 25, 2021

Adversarially Robust One-class Novelty Detection

Shao-Yuan Lo, Poojan Oza, Vishal M. Patel

arXiv:2108.11168v212.642 citationsHas Code

Originality Highly original

AI Analysis

This addresses the problem of adversarial robustness in one-class novelty detection for security-critical applications, representing an incremental improvement with a novel method for a known bottleneck.

The paper tackles the vulnerability of deep one-class novelty detectors to adversarial attacks, showing that existing methods are susceptible and common defenses are ineffective, and proposes a defense strategy called PrincipaLS that manipulates the latent space to improve robustness, achieving consistent enhancements across multiple attacks, datasets, and detectors.

One-class novelty detectors are trained with examples of a particular class and are tasked with identifying whether a query example belongs to the same known class. Most recent advances adopt a deep auto-encoder style architecture to compute novelty scores for detecting novel class data. Deep networks have shown to be vulnerable to adversarial attacks, yet little focus is devoted to studying the adversarial robustness of deep novelty detectors. In this paper, we first show that existing novelty detectors are susceptible to adversarial examples. We further demonstrate that commonly-used defense approaches for classification tasks have limited effectiveness in one-class novelty detection. Hence, we need a defense specifically designed for novelty detection. To this end, we propose a defense strategy that manipulates the latent space of novelty detectors to improve the robustness against adversarial examples. The proposed method, referred to as Principal Latent Space (PrincipaLS), learns the incrementally-trained cascade principal components in the latent space to robustify novelty detectors. PrincipaLS can purify latent space against adversarial examples and constrain latent space to exclusively model the known class distribution. We conduct extensive experiments on eight attacks, five datasets and seven novelty detectors, showing that PrincipaLS consistently enhances the adversarial robustness of novelty detection models. Code is available at https://github.com/shaoyuanlo/PrincipaLS

View on arXiv PDF Code

Similar