LG AIJan 20, 2022

Federated Learning with Heterogeneous Architectures using Graph HyperNetworks

Or Litany, Haggai Maron, David Acuna, Jan Kautz, Gal Chechik, Sanja Fidler

arXiv:2201.08459v114.132 citations

Originality Highly original

AI Analysis

This addresses the problem of enabling cross-platform or inter-organizational FL while preserving data privacy and architectural proprietary, representing a novel method for a known bottleneck.

The paper tackles the limitation of standard Federated Learning (FL) to identical client architectures by proposing a framework using graph hypernetworks for parameter sharing across heterogeneous models, achieving notably better performance on benchmarks and showing generalization to unseen architectures.

Standard Federated Learning (FL) techniques are limited to clients with identical network architectures. This restricts potential use-cases like cross-platform training or inter-organizational collaboration when both data privacy and architectural proprietary are required. We propose a new FL framework that accommodates heterogeneous client architecture by adopting a graph hypernetwork for parameter sharing. A property of the graph hyper network is that it can adapt to various computational graphs, thereby allowing meaningful parameter sharing across models. Unlike existing solutions, our framework does not limit the clients to share the same architecture type, makes no use of external data and does not require clients to disclose their model architecture. Compared with distillation-based and non-graph hypernetwork baselines, our method performs notably better on standard benchmarks. We additionally show encouraging generalization performance to unseen architectures.

View on arXiv PDF

Similar