MLLGJun 2, 2020

Meta Learning as Bayes Risk Minimization

arXiv:2006.01488v11 citations
Originality Incremental advance
AI Analysis

This work provides a theoretical improvement for meta-learning methods, particularly in probabilistic frameworks, but is incremental as it builds on existing approaches like Neural Process.

The authors tackled the problem of meta-learning by reframing it as Bayesian risk minimization, addressing issues in Neural Process with a novel Gaussian approximation for posterior distributions that converges correctly, and demonstrated competitive results on benchmark datasets.

Meta-Learning is a family of methods that use a set of interrelated tasks to learn a model that can quickly learn a new query task from a possibly small contextual dataset. In this study, we use a probabilistic framework to formalize what it means for two tasks to be related and reframe the meta-learning problem into the problem of Bayesian risk minimization (BRM). In our formulation, the BRM optimal solution is given by the predictive distribution computed from the posterior distribution of the task-specific latent variable conditioned on the contextual dataset, and this justifies the philosophy of Neural Process. However, the posterior distribution in Neural Process violates the way the posterior distribution changes with the contextual dataset. To address this problem, we present a novel Gaussian approximation for the posterior distribution that generalizes the posterior of the linear Gaussian model. Unlike that of the Neural Process, our approximation of the posterior distributions converges to the maximum likelihood estimate with the same rate as the true posterior distribution. We also demonstrate the competitiveness of our approach on benchmark datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes