MLLGJun 7, 2016

Towards a Neural Statistician

arXiv:1606.02185v2442 citations
AI Analysis

This work addresses the challenge of reusing knowledge across datasets for machine learning, which is incremental as it builds on existing variational autoencoder methods.

The paper tackles the problem of learning dataset-level representations by extending a variational autoencoder to compute unsupervised statistics of datasets, enabling efficient learning on new datasets for tasks like clustering, generative model transfer, and classification.

An efficient learner is one who reuses what they already know to tackle a new problem. For a machine learner, this means understanding the similarities amongst datasets. In order to do this, one must take seriously the idea of working with datasets, rather than datapoints, as the key objects to model. Towards this goal, we demonstrate an extension of a variational autoencoder that can learn a method for computing representations, or statistics, of datasets in an unsupervised fashion. The network is trained to produce statistics that encapsulate a generative model for each dataset. Hence the network enables efficient learning from new datasets for both unsupervised and supervised tasks. We show that we are able to learn statistics that can be used for: clustering datasets, transferring generative models to new datasets, selecting representative samples of datasets and classifying previously unseen classes. We refer to our model as a neural statistician, and by this we mean a neural network that can learn to compute summary statistics of datasets without supervision.

Code Implementations5 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes