LGCVMLOct 13, 2022

COLLIDER: A Robust Training Framework for Backdoor Data

arXiv:2210.06704v110.48 citationsh-index: 56Has Code
Originality Incremental advance
AI Analysis

This addresses the vulnerability of DNNs to backdoor attacks, offering a novel training approach for improved security, though it is incremental as it builds on existing detection methods.

The paper tackles the problem of backdoor attacks in deep neural networks by introducing COLLIDER, a robust training framework that filters poisoned data using geometric coreset selection, reducing the backdoor success rate significantly on various datasets.

Deep neural network (DNN) classifiers are vulnerable to backdoor attacks. An adversary poisons some of the training data in such attacks by installing a trigger. The goal is to make the trained DNN output the attacker's desired class whenever the trigger is activated while performing as usual for clean data. Various approaches have recently been proposed to detect malicious backdoored DNNs. However, a robust, end-to-end training approach, like adversarial training, is yet to be discovered for backdoor poisoned data. In this paper, we take the first step toward such methods by developing a robust training framework, COLLIDER, that selects the most prominent samples by exploiting the underlying geometric structures of the data. Specifically, we effectively filter out candidate poisoned data at each training epoch by solving a geometrical coreset selection objective. We first argue how clean data samples exhibit (1) gradients similar to the clean majority of data and (2) low local intrinsic dimensionality (LID). Based on these criteria, we define a novel coreset selection objective to find such samples, which are used for training a DNN. We show the effectiveness of the proposed method for robust training of DNNs on various poisoned datasets, reducing the backdoor success rate significantly.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes