LG AI CR DCMay 20, 2025

FedGraM: Defending Against Untargeted Attacks in Federated Learning via Embedding Gram Matrix

arXiv:2505.14024v14.1h-index: 2

Originality Incremental advance

AI Analysis

This addresses security vulnerabilities in federated learning for privacy-sensitive applications, offering a novel detection-based defense against untargeted attacks, though it appears incremental as it builds on existing generalization concepts.

The paper tackles the problem of untargeted attacks degrading global model performance in federated learning by proposing FedGraM, a robust aggregation method that detects and removes malicious models using embedding Gram matrix norms, achieving exceptional performance and outperforming state-of-the-art defenses with limited auxiliary data.

Federated Learning (FL) enables geographically distributed clients to collaboratively train machine learning models by sharing only their local models, ensuring data privacy. However, FL is vulnerable to untargeted attacks that aim to degrade the global model's performance on the underlying data distribution. Existing defense mechanisms attempt to improve FL's resilience against such attacks, but their effectiveness is limited in practical FL environments due to data heterogeneity. On the contrary, we aim to detect and remove the attacks to mitigate their impact. Generalization contribution plays a crucial role in distinguishing untargeted attacks. Our observations indicate that, with limited data, the divergence between embeddings representing different classes provides a better measure of generalization than direct accuracy. In light of this, we propose a novel robust aggregation method, FedGraM, designed to defend against untargeted attacks in FL. The server maintains an auxiliary dataset containing one sample per class to support aggregation. This dataset is fed to the local models to extract embeddings. Then, the server calculates the norm of the Gram Matrix of the embeddings for each local model. The norm serves as an indicator of each model's inter-class separation capability in the embedding space. FedGraM identifies and removes potentially malicious models by filtering out those with the largest norms, then averages the remaining local models to form the global model. We conduct extensive experiments to evaluate the performance of FedGraM. Our empirical results show that with limited data samples used to construct the auxiliary dataset, FedGraM achieves exceptional performance, outperforming state-of-the-art defense methods.

View on arXiv PDF

Similar