LGMLMay 14, 2019

Deep Neural Architecture Search with Deep Graph Bayesian Optimization

arXiv:1905.06159v155 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of efficiently searching for optimal deep neural architectures, representing an incremental improvement over existing Bayesian optimization methods.

The paper tackles the problem of neural architecture search by proposing a Bayesian graph neural network as a surrogate to automatically extract features from deep neural architectures, which significantly outperforms comparative methods on benchmark tasks.

Bayesian optimization (BO) is an effective method of finding the global optima of black-box functions. Recently BO has been applied to neural architecture search and shows better performance than pure evolutionary strategies. All these methods adopt Gaussian processes (GPs) as surrogate function, with the handcraft similarity metrics as input. In this work, we propose a Bayesian graph neural network as a new surrogate, which can automatically extract features from deep neural architectures, and use such learned features to fit and characterize black-box objectives and their uncertainty. Based on the new surrogate, we then develop a graph Bayesian optimization framework to address the challenging task of deep neural architecture search. Experiment results show our method significantly outperforms the comparative methods on benchmark tasks.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes