LGMLOct 18, 2019

Fully Parallel Hyperparameter Search: Reshaped Space-Filling

arXiv:1910.08406v229 citations
Originality Highly original
AI Analysis

This addresses hyperparameter optimization for machine learning practitioners, offering a novel method that is not incremental but provides significant improvements over standard techniques.

The paper tackles the problem of fully parallel hyperparameter search by showing that existing space-filling designs only offer constant-factor improvements over random search, and introduces a new reshaping approach that achieves substantial gains, with empirical results demonstrating improved performance on expensive AI tasks.

Space-filling designs such as scrambled-Hammersley, Latin Hypercube Sampling and Jittered Sampling have been proposed for fully parallel hyperparameter search, and were shown to be more effective than random or grid search. In this paper, we show that these designs only improve over random search by a constant factor. In contrast, we introduce a new approach based on reshaping the search distribution, which leads to substantial gains over random search, both theoretically and empirically. We propose two flavors of reshaping. First, when the distribution of the optimum is some known $P_0$, we propose Recentering, which uses as search distribution a modified version of $P_0$ tightened closer to the center of the domain, in a dimension-dependent and budget-dependent manner. Second, we show that in a wide range of experiments with $P_0$ unknown, using a proposed Cauchy transformation, which simultaneously has a heavier tail (for unbounded hyperparameters) and is closer to the boundaries (for bounded hyperparameters), leads to improved performances. Besides artificial experiments and simple real world tests on clustering or Salmon mappings, we check our proposed methods on expensive artificial intelligence tasks such as attend/infer/repeat, video next frame segmentation forecasting and progressive generative adversarial networks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes