LGCRMar 22, 2022

Semi-Targeted Model Poisoning Attack on Federated Learning via Backward Error Analysis

arXiv:2203.11633v224 citationsh-index: 23
Originality Incremental advance
AI Analysis

This addresses security vulnerabilities in federated learning systems, particularly for scenarios with limited adversary knowledge, but it is incremental as it builds on existing poisoning methods.

The paper tackles the problem of semi-targeted model poisoning attacks in federated learning, where the goal is to misclassify data from a source class without a predetermined target class, and proposes the Attacking Distance-aware Attack (ADA) to enhance poisoning by optimizing the target class in feature space, achieving a 1.8 times increase in attack performance in the most challenging case.

Model poisoning attacks on federated learning (FL) intrude in the entire system via compromising an edge model, resulting in malfunctioning of machine learning models. Such compromised models are tampered with to perform adversary-desired behaviors. In particular, we considered a semi-targeted situation where the source class is predetermined however the target class is not. The goal is to cause the global classifier to misclassify data of the source class. Though approaches such as label flipping have been adopted to inject poisoned parameters into FL, it has been shown that their performances are usually class-sensitive varying with different target classes applied. Typically, an attack can become less effective when shifting to a different target class. To overcome this challenge, we propose the Attacking Distance-aware Attack (ADA) to enhance a poisoning attack by finding the optimized target class in the feature space. Moreover, we studied a more challenging situation where an adversary had limited prior knowledge about a client's data. To tackle this problem, ADA deduces pair-wise distances between different classes in the latent feature space from shared model parameters based on the backward error analysis. We performed extensive empirical evaluations on ADA by varying the factor of attacking frequency in three different image classification tasks. As a result, ADA succeeded in increasing the attack performance by 1.8 times in the most challenging case with an attacking frequency of 0.01.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes