LGPRSep 3, 2023

A Comparative Evaluation of FedAvg and Per-FedAvg Algorithms for Dirichlet Distributed Heterogeneous Data

arXiv:2309.01275v134 citations
Originality Synthesis-oriented
AI Analysis

This work addresses data heterogeneity in federated learning, which is an incremental improvement for decentralized machine learning applications.

The paper compared Federated Averaging (FedAvg) and Personalized Federated Averaging (Per-FedAvg) on Non-IID data modeled with a Dirichlet distribution, finding that Per-FedAvg performed better under high heterogeneity.

In this paper, we investigate Federated Learning (FL), a paradigm of machine learning that allows for decentralized model training on devices without sharing raw data, there by preserving data privacy. In particular, we compare two strategies within this paradigm: Federated Averaging (FedAvg) and Personalized Federated Averaging (Per-FedAvg), focusing on their performance with Non-Identically and Independently Distributed (Non-IID) data. Our analysis shows that the level of data heterogeneity, modeled using a Dirichlet distribution, significantly affects the performance of both strategies, with Per-FedAvg showing superior robustness in conditions of high heterogeneity. Our results provide insights into the development of more effective and efficient machine learning strategies in a decentralized setting.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes