DCLGNov 1, 2021

Implicit Model Specialization through DAG-based Decentralized Federated Learning

arXiv:2111.01257v222 citations
Originality Highly original
AI Analysis

It addresses the challenge of data heterogeneity for distributed clients in federated learning, offering a novel method that combines personalization and poisoning robustness, though it appears incremental in its specialization focus.

The paper tackles the problem of non-IID data in federated learning by proposing a DAG-based decentralized approach that enables implicit model specialization, resulting in stable model accuracy and less variance across clients compared to federated averaging.

Federated learning allows a group of distributed clients to train a common machine learning model on private data. The exchange of model updates is managed either by a central entity or in a decentralized way, e.g. by a blockchain. However, the strong generalization across all clients makes these approaches unsuited for non-independent and identically distributed (non-IID) data. We propose a unified approach to decentralization and personalization in federated learning that is based on a directed acyclic graph (DAG) of model updates. Instead of training a single global model, clients specialize on their local data while using the model updates from other clients dependent on the similarity of their respective data. This specialization implicitly emerges from the DAG-based communication and selection of model updates. Thus, we enable the evolution of specialized models, which focus on a subset of the data and therefore cover non-IID data better than federated learning in a centralized or blockchain-based setup. To the best of our knowledge, the proposed solution is the first to unite personalization and poisoning robustness in fully decentralized federated learning. Our evaluation shows that the specialization of models emerges directly from the DAG-based communication of model updates on three different datasets. Furthermore, we show stable model accuracy and less variance across clients when compared to federated averaging.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes