LGDec 28, 2023

Replica Tree-based Federated Learning using Limited Data

arXiv:2312.17159v11 citationsh-index: 31Has CodeNeural Networks
Originality Incremental advance
AI Analysis

This addresses a practical challenge in scenarios like medical institutions where data and client numbers are constrained, though it appears incremental as it builds on existing federated learning methods.

The paper tackles the problem of federated learning with limited data and a small number of clients by proposing RepTreeFL, a framework that replicates clients to increase model diversity and uses tree-based aggregation, achieving outperformance in tasks like graph generation and image classification.

Learning from limited data has been extensively studied in machine learning, considering that deep neural networks achieve optimal performance when trained using a large amount of samples. Although various strategies have been proposed for centralized training, the topic of federated learning with small datasets remains largely unexplored. Moreover, in realistic scenarios, such as settings where medical institutions are involved, the number of participating clients is also constrained. In this work, we propose a novel federated learning framework, named RepTreeFL. At the core of the solution is the concept of a replica, where we replicate each participating client by copying its model architecture and perturbing its local data distribution. Our approach enables learning from limited data and a small number of clients by aggregating a larger number of models with diverse data distributions. Furthermore, we leverage the hierarchical structure of the client network (both original and virtual), alongside the model diversity across replicas, and introduce a diversity-based tree aggregation, where replicas are combined in a tree-like manner and the aggregation weights are dynamically updated based on the model discrepancy. We evaluated our method on two tasks and two types of data, graph generation and image classification (binary and multi-class), with both homogeneous and heterogeneous model architectures. Experimental results demonstrate the effectiveness and outperformance of RepTreeFL in settings where both data and clients are limited. Our code is available at https://github.com/basiralab/RepTreeFL.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes