LGMLOct 22, 2019

No-regret Non-convex Online Meta-Learning

arXiv:1910.10196v415 citations
Originality Incremental advance
AI Analysis

This work addresses continual lifelong learning for AI systems by enabling robust meta-learning in non-convex scenarios, though it is incremental as it extends existing convex frameworks.

The paper tackles the problem of online meta-learning in non-convex settings by generalizing the framework from convex to non-convex and introducing local regret as a performance measure, achieving a logarithmic local regret theoretically and demonstrating empirical superiority on a real-world task.

The online meta-learning framework is designed for the continual lifelong learning setting. It bridges two fields: meta-learning which tries to extract prior knowledge from past tasks for fast learning of future tasks, and online-learning which deals with the sequential setting where problems are revealed one by one. In this paper, we generalize the original framework from convex to non-convex setting, and introduce the local regret as the alternative performance measure. We then apply this framework to stochastic settings, and show theoretically that it enjoys a logarithmic local regret, and is robust to any hyperparameter initialization. The empirical test on a real-world task demonstrates its superiority compared with traditional methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes