CVJun 26, 2019

Universal Litmus Patterns: Revealing Backdoor Attacks in CNNs

Soheil Kolouri, Aniruddha Saha, Hamed Pirsiavash, Heiko Hoffmann

arXiv:1906.10842v228.2263 citationsHas Code

Originality Highly original

AI Analysis

This addresses security vulnerabilities in deep learning models for applications like traffic sign recognition and image classification, representing a novel method for a known bottleneck.

The paper tackles the problem of detecting backdoor attacks in convolutional neural networks by introducing Universal Litmus Patterns (ULPs), which enable fast detection with only a few forward passes, achieving effectiveness across thousands of networks on four benchmark datasets.

The unprecedented success of deep neural networks in many applications has made these networks a prime target for adversarial exploitation. In this paper, we introduce a benchmark technique for detecting backdoor attacks (aka Trojan attacks) on deep convolutional neural networks (CNNs). We introduce the concept of Universal Litmus Patterns (ULPs), which enable one to reveal backdoor attacks by feeding these universal patterns to the network and analyzing the output (i.e., classifying the network as `clean' or `corrupted'). This detection is fast because it requires only a few forward passes through a CNN. We demonstrate the effectiveness of ULPs for detecting backdoor attacks on thousands of networks with different architectures trained on four benchmark datasets, namely the German Traffic Sign Recognition Benchmark (GTSRB), MNIST, CIFAR10, and Tiny-ImageNet. The codes and train/test models for this paper can be found here https://umbcvision.github.io/Universal-Litmus-Patterns/.

View on arXiv PDF Code

Similar