MLLGOct 23, 2019

Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales

arXiv:1910.10367v23 citations
Originality Incremental advance
AI Analysis

This work addresses overfitting in neural network controllers for robotics, offering a scalable approach to performance bounds, but it is incremental as it builds on existing PAC-Bayes and variational inference frameworks.

The paper tackles the problem of overfitting and unwarranted extrapolations in neural network controllers by proposing a method to simultaneously learn functions and estimate performance bounds that scale to high-dimensional, non-linear environments without explicit assumptions, validated on MuJoCo locomotion tasks with improved generalization.

Neural Network based controllers hold enormous potential to learn complex, high-dimensional functions. However, they are prone to overfitting and unwarranted extrapolations. PAC Bayes is a generalized framework which is more resistant to overfitting and that yields performance bounds that hold with arbitrarily high probability even on the unjustified extrapolations. However, optimizing to learn such a function and a bound is intractable for complex tasks. In this work, we propose a method to simultaneously learn such a function and estimate performance bounds that scale organically to high-dimensions, non-linear environments without making any explicit assumptions about the environment. We build our approach on a parallel that we draw between the formulations called ELBO and PAC Bayes when the risk metric is negative log likelihood. Through our experiments on multiple high dimensional MuJoCo locomotion tasks, we validate the correctness of our theory, show its ability to generalize better, and investigate the factors that are important for its learning. The code for all the experiments is available at https://bit.ly/2qv0JjA.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes