LGAIMLMay 20, 2018

Model Aggregation via Good-Enough Model Spaces

arXiv:1805.07782v34 citations
Originality Highly original
AI Analysis

This addresses communication and privacy constraints in distributed learning, offering a novel method for efficient model aggregation.

The paper tackles the problem of learning a global model from data partitioned across multiple nodes without sharing data, by introducing the Good-Enough Model Spaces (GEMS) framework that intersects sets of 'good-enough' models, resulting in an average improvement of up to 15 points in accuracy over baseline aggregation techniques.

In many applications, the training data for a machine learning task is partitioned across multiple nodes, and aggregating this data may be infeasible due to communication, privacy, or storage constraints. Existing distributed optimization methods for learning global models in these settings typically aggregate local updates from each node in an iterative fashion. However, these approaches require many rounds of communication between nodes, and assume that updates can be synchronously shared across a connected network. In this work, we present Good-Enough Model Spaces (GEMS), a novel framework for learning a global model by carefully intersecting the sets of "good-enough" models across each node. Our approach utilizes minimal communication and does not require sharing of data between nodes. We present methods for learning both convex models and neural networks within this framework and discuss how small samples of held-out data can be used for post-learning fine-tuning. In experiments on image and medical datasets, our approach on average improves upon other baseline aggregation techniques such as ensembling or model averaging by as much as 15 points (accuracy).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes