LGDCAug 20, 2025

Federated Distillation on Edge Devices: Efficient Client-Side Filtering for Non-IID Data

arXiv:2508.14769v11 citationsh-index: 2Has Code2025 3rd International Conference on Federated Learning Technologies and Applications (FLTA)
Originality Incremental advance
AI Analysis

This work addresses efficiency and scalability issues in federated learning for edge devices, offering an incremental improvement over existing methods.

The paper tackles the challenge of inefficient client-side filtering and server-side latency in federated distillation for non-IID data by proposing EdgeFD, which uses a KMeans-based estimator to improve knowledge sharing and achieves accuracy close to IID scenarios with reduced computational overhead.

Federated distillation has emerged as a promising collaborative machine learning approach, offering enhanced privacy protection and reduced communication compared to traditional federated learning by exchanging model outputs (soft logits) rather than full model parameters. However, existing methods employ complex selective knowledge-sharing strategies that require clients to identify in-distribution proxy data through computationally expensive statistical density ratio estimators. Additionally, server-side filtering of ambiguous knowledge introduces latency to the process. To address these challenges, we propose a robust, resource-efficient EdgeFD method that reduces the complexity of the client-side density ratio estimation and removes the need for server-side filtering. EdgeFD introduces an efficient KMeans-based density ratio estimator for effectively filtering both in-distribution and out-of-distribution proxy data on clients, significantly improving the quality of knowledge sharing. We evaluate EdgeFD across diverse practical scenarios, including strong non-IID, weak non-IID, and IID data distributions on clients, without requiring a pre-trained teacher model on the server for knowledge distillation. Experimental results demonstrate that EdgeFD outperforms state-of-the-art methods, consistently achieving accuracy levels close to IID scenarios even under heterogeneous and challenging conditions. The significantly reduced computational overhead of the KMeans-based estimator is suitable for deployment on resource-constrained edge devices, thereby enhancing the scalability and real-world applicability of federated distillation. The code is available online for reproducibility.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes