LGMLMay 13, 2019

Tabular Benchmarks for Joint Architecture and Hyperparameter Optimization

arXiv:1905.04970v1104 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of cumbersome empirical evaluation for HPO researchers by offering efficient, reproducible benchmarks, though it is incremental as it builds on existing HPO methods with new data.

The paper tackles the challenge of evaluating hyperparameter optimization (HPO) methods by providing cheap, realistic benchmarks based on feed-forward neural networks across four regression datasets, enabling an in-depth analysis of optimization properties and hyperparameter importance, and exhaustively comparing state-of-the-art HPO methods for performance and robustness.

Due to the high computational demands executing a rigorous comparison between hyperparameter optimization (HPO) methods is often cumbersome. The goal of this paper is to facilitate a better empirical evaluation of HPO methods by providing benchmarks that are cheap to evaluate, but still represent realistic use cases. We believe these benchmarks provide an easy and efficient way to conduct reproducible experiments for neural hyperparameter search. Our benchmarks consist of a large grid of configurations of a feed forward neural network on four different regression datasets including architectural hyperparameters and hyperparameters concerning the training pipeline. Based on this data, we performed an in-depth analysis to gain a better understanding of the properties of the optimization problem, as well as of the importance of different types of hyperparameters. Second, we exhaustively compared various different state-of-the-art methods from the hyperparameter optimization literature on these benchmarks in terms of performance and robustness.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes