LG AI NE MLDec 29, 2020

Meta Learning Backpropagation And Improving It

arXiv:2012.14905v418.467 citations

Originality Highly original

AI Analysis

This work addresses the problem of creating more flexible and generalizable learning algorithms for the machine learning community, potentially offering an alternative to traditional gradient descent.

This paper proposes Variable Shared Meta Learning (VSML), a framework that unifies existing meta-learning concepts by using weight-sharing and sparsity in neural networks. A VSML implementation with tiny LSTMs replaces network weights, allowing backpropagation to be executed in forward-mode and meta-learning new learning algorithms that generalize to new datasets without explicit gradient calculation.

Many concepts have been proposed for meta learning with neural networks (NNs), e.g., NNs that learn to reprogram fast weights, Hebbian plasticity, learned learning rules, and meta recurrent NNs. Our Variable Shared Meta Learning (VSML) unifies the above and demonstrates that simple weight-sharing and sparsity in an NN is sufficient to express powerful learning algorithms (LAs) in a reusable fashion. A simple implementation of VSML where the weights of a neural network are replaced by tiny LSTMs allows for implementing the backpropagation LA solely by running in forward-mode. It can even meta learn new LAs that differ from online backpropagation and generalize to datasets outside of the meta training distribution without explicit gradient calculation. Introspection reveals that our meta learned LAs learn through fast association in a way that is qualitatively different from gradient descent.

View on arXiv PDF

Similar