MLAILGNov 1, 2021

End-to-End Learning of Deep Kernel Acquisition Functions for Bayesian Optimization

arXiv:2111.00639v15 citations
Originality Highly original
AI Analysis

This addresses the challenge of optimizing complex functions in high-dimensional spaces for applications like text analysis, though it is incremental as it builds on prior neural kernel methods.

The paper tackles the problem of Bayesian optimization (BO) on high-dimensional data by proposing a meta-learning method that trains neural network-based kernels to directly improve BO performance, achieving better results than existing methods on three text document datasets.

For Bayesian optimization (BO) on high-dimensional data with complex structure, neural network-based kernels for Gaussian processes (GPs) have been used to learn flexible surrogate functions by the high representation power of deep learning. However, existing methods train neural networks by maximizing the marginal likelihood, which do not directly improve the BO performance. In this paper, we propose a meta-learning method for BO with neural network-based kernels that minimizes the expected gap between the true optimum value and the best value found by BO. We model a policy, which takes the current evaluated data points as input and outputs the next data point to be evaluated, by a neural network, where neural network-based kernels, GPs, and mutual information-based acquisition functions are used as its layers. With our model, the neural network-based kernel is trained to be appropriate for the acquisition function by backpropagating the gap through the acquisition function and GP. Our model is trained by a reinforcement learning framework from multiple tasks. Since the neural network is shared across different tasks, we can gather knowledge on BO from multiple training tasks, and use the knowledge for unseen test tasks. In experiments using three text document datasets, we demonstrate that the proposed method achieves better BO performance than the existing methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes