LGCRCVOct 15, 2024

Adversarially Guided Stateful Defense Against Backdoor Attacks in Federated Deep Learning

arXiv:2410.11205v15 citationsh-index: 67Has CodeACSAC
Originality Incremental advance
AI Analysis

This addresses the problem of backdoor attacks in Federated Learning for practitioners, offering a robust defense with minimal data requirements, though it is incremental as it builds on existing cluster-based approaches.

The paper tackles the vulnerability of Federated Learning to backdoor attacks by proposing AGSD, a defense that uses adversarial perturbations on a small held-out dataset to compute a trust index for cluster selection and maintains client trust states, achieving minimal clean accuracy drops (5% worst-case) while outperforming state-of-the-art defenses in realistic settings.

Recent works have shown that Federated Learning (FL) is vulnerable to backdoor attacks. Existing defenses cluster submitted updates from clients and select the best cluster for aggregation. However, they often rely on unrealistic assumptions regarding client submissions and sampled clients population while choosing the best cluster. We show that in realistic FL settings, state-of-the-art (SOTA) defenses struggle to perform well against backdoor attacks in FL. To address this, we highlight that backdoored submissions are adversarially biased and overconfident compared to clean submissions. We, therefore, propose an Adversarially Guided Stateful Defense (AGSD) against backdoor attacks on Deep Neural Networks (DNNs) in FL scenarios. AGSD employs adversarial perturbations to a small held-out dataset to compute a novel metric, called the trust index, that guides the cluster selection without relying on any unrealistic assumptions regarding client submissions. Moreover, AGSD maintains a trust state history of each client that adaptively penalizes backdoored clients and rewards clean clients. In realistic FL settings, where SOTA defenses mostly fail to resist attacks, AGSD mostly outperforms all SOTA defenses with minimal drop in clean accuracy (5% in the worst-case compared to best accuracy) even when (a) given a very small held-out dataset -- typically AGSD assumes 50 samples (<= 0.1% of the training data) and (b) no heldout dataset is available, and out-of-distribution data is used instead. For reproducibility, our code will be openly available at: https://github.com/hassanalikhatim/AGSD.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes