CVJun 10, 2020

Scalable Backdoor Detection in Neural Networks

arXiv:2006.05646v125 citations
Originality Highly original
AI Analysis

This addresses a critical security vulnerability in neural networks for AI practitioners, offering a more efficient and effective detection solution.

The paper tackles the problem of detecting Trojan attacks in deep learning models by proposing a scalable backdoor detection method that achieves perfect separation of Trojaned models from pure ones, improving over the state-of-the-art.

Recently, it has been shown that deep learning models are vulnerable to Trojan attacks, where an attacker can install a backdoor during training time to make the resultant model misidentify samples contaminated with a small trigger patch. Current backdoor detection methods fail to achieve good detection performance and are computationally expensive. In this paper, we propose a novel trigger reverse-engineering based approach whose computational complexity does not scale with the number of labels, and is based on a measure that is both interpretable and universal across different network and patch types. In experiments, we observe that our method achieves a perfect score in separating Trojaned models from pure models, which is an improvement over the current state-of-the art method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes