CVJun 26, 2019

Universal Litmus Patterns: Revealing Backdoor Attacks in CNNs

arXiv:1906.10842v2263 citations
Originality Highly original
AI Analysis

This addresses security vulnerabilities in deep learning models for applications like traffic sign recognition and image classification, representing a novel method for a known bottleneck.

The paper tackles the problem of detecting backdoor attacks in convolutional neural networks by introducing Universal Litmus Patterns (ULPs), which enable fast detection with only a few forward passes, achieving effectiveness across thousands of networks on four benchmark datasets.

The unprecedented success of deep neural networks in many applications has made these networks a prime target for adversarial exploitation. In this paper, we introduce a benchmark technique for detecting backdoor attacks (aka Trojan attacks) on deep convolutional neural networks (CNNs). We introduce the concept of Universal Litmus Patterns (ULPs), which enable one to reveal backdoor attacks by feeding these universal patterns to the network and analyzing the output (i.e., classifying the network as `clean' or `corrupted'). This detection is fast because it requires only a few forward passes through a CNN. We demonstrate the effectiveness of ULPs for detecting backdoor attacks on thousands of networks with different architectures trained on four benchmark datasets, namely the German Traffic Sign Recognition Benchmark (GTSRB), MNIST, CIFAR10, and Tiny-ImageNet. The codes and train/test models for this paper can be found here https://umbcvision.github.io/Universal-Litmus-Patterns/.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes