LGJan 26, 2018

Recasting Gradient-Based Meta-Learning as Hierarchical Bayes

arXiv:1801.08930v1550 citations
Originality Synthesis-oriented
AI Analysis

This work offers a theoretical reinterpretation of an existing meta-learning method, which is incremental but clarifies its Bayesian foundations for researchers in machine learning.

The paper reformulates the MAML algorithm as hierarchical Bayesian inference, providing a theoretical framework to understand its operation and enabling computational improvements through approximate inference techniques.

Meta-learning allows an intelligent agent to leverage prior learning episodes as a basis for quickly improving performance on a novel task. Bayesian hierarchical modeling provides a theoretical framework for formalizing meta-learning as inference for a set of parameters that are shared across tasks. Here, we reformulate the model-agnostic meta-learning algorithm (MAML) of Finn et al. (2017) as a method for probabilistic inference in a hierarchical Bayesian model. In contrast to prior methods for meta-learning via hierarchical Bayes, MAML is naturally applicable to complex function approximators through its use of a scalable gradient descent procedure for posterior inference. Furthermore, the identification of MAML as hierarchical Bayes provides a way to understand the algorithm's operation as a meta-learning procedure, as well as an opportunity to make use of computational strategies for efficient inference. We use this opportunity to propose an improvement to the MAML algorithm that makes use of techniques from approximate inference and curvature estimation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes