LGAIDCNov 25, 2020

MetaGater: Fast Learning of Conditional Channel Gated Networks via Federated Meta-Learning

arXiv:2011.12511v28 citations
AI Analysis

This work aims to improve the computational efficiency and adaptability of deep learning models for resource-constrained environments, which is a significant problem for edge computing and mobile AI applications.

This paper addresses the challenge of deploying large deep learning models on resource-limited devices by proposing MetaGater, a federated meta-learning approach. It jointly trains a backbone network and a channel gating module to dynamically select filters, enabling fast adaptation to new tasks with one-step gradient descent from meta-initializations.

While deep learning has achieved phenomenal successes in many AI applications, its enormous model size and intensive computation requirements pose a formidable challenge to the deployment in resource-limited nodes. There has recently been an increasing interest in computationally-efficient learning methods, e.g., quantization, pruning and channel gating. However, most existing techniques cannot adapt to different tasks quickly. In this work, we advocate a holistic approach to jointly train the backbone network and the channel gating which enables dynamical selection of a subset of filters for more efficient local computation given the data input. Particularly, we develop a federated meta-learning approach to jointly learn good meta-initializations for both backbone networks and gating modules, by making use of the model similarity across learning tasks on different nodes. In this way, the learnt meta-gating module effectively captures the important filters of a good meta-backbone network, based on which a task-specific conditional channel gated network can be quickly adapted, i.e., through one-step gradient descent, from the meta-initializations in a two-stage procedure using new samples of that task. The convergence of the proposed federated meta-learning algorithm is established under mild conditions. Experimental results corroborate the effectiveness of our method in comparison to related work.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes