LGMLDec 23, 2021

Using Sequential Statistical Tests for Efficient Hyperparameter Tuning

arXiv:2112.12438v23 citations
Originality Incremental advance
AI Analysis

This addresses the time-consuming nature of hyperparameter tuning for machine learning practitioners, though it is incremental as it builds on existing random search methods.

The paper tackled the problem of inefficient hyperparameter tuning by proposing Sequential Random Search (SQRS), which uses sequential statistical tests to eliminate inferior configurations early, resulting in noticeably fewer evaluations while finding similarly well-performing settings.

Hyperparameter tuning is one of the the most time-consuming parts in machine learning. Despite the existence of modern optimization algorithms that minimize the number of evaluations needed, evaluations of a single setting may still be expensive. Usually a resampling technique is used, where the machine learning method has to be fitted a fixed number of k times on different training datasets. The respective mean performance of the k fits is then used as performance estimator. Many hyperparameter settings could be discarded after less than k resampling iterations if they are clearly inferior to high-performing settings. However, resampling is often performed until the very end, wasting a lot of computational effort. To this end, we propose the Sequential Random Search (SQRS) which extends the regular random search algorithm by a sequential testing procedure aimed at detecting and eliminating inferior parameter configurations early. We compared our SQRS with regular random search using multiple publicly available regression and classification datasets. Our simulation study showed that the SQRS is able to find similarly well-performing parameter settings while requiring noticeably fewer evaluations. Our results underscore the potential for integrating sequential tests into hyperparameter tuning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes