LGAIJan 20, 2022

Federated Learning with Heterogeneous Architectures using Graph HyperNetworks

arXiv:2201.08459v132 citations
AI Analysis

This addresses the problem of enabling cross-platform or inter-organizational FL while preserving data privacy and architectural proprietary, representing a novel method for a known bottleneck.

The paper tackles the limitation of standard Federated Learning (FL) to identical client architectures by proposing a framework using graph hypernetworks for parameter sharing across heterogeneous models, achieving notably better performance on benchmarks and showing generalization to unseen architectures.

Standard Federated Learning (FL) techniques are limited to clients with identical network architectures. This restricts potential use-cases like cross-platform training or inter-organizational collaboration when both data privacy and architectural proprietary are required. We propose a new FL framework that accommodates heterogeneous client architecture by adopting a graph hypernetwork for parameter sharing. A property of the graph hyper network is that it can adapt to various computational graphs, thereby allowing meaningful parameter sharing across models. Unlike existing solutions, our framework does not limit the clients to share the same architecture type, makes no use of external data and does not require clients to disclose their model architecture. Compared with distillation-based and non-graph hypernetwork baselines, our method performs notably better on standard benchmarks. We additionally show encouraging generalization performance to unseen architectures.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes