LGMLDec 23, 2019

AutoML: Exploration v.s. Exploitation

arXiv:1912.10746v21 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the time and resource efficiency of AutoML systems for practitioners, but the findings are incremental as they confirm existing exploration-exploitation trade-offs without new methods.

The paper empirically tests whether focusing on the most promising classifiers improves AutoML pipeline performance under time constraints, finding that exploiting top classifiers does not yield statistically better results than exploring the entire search space, even with longer time budgets.

Building a machine learning (ML) pipeline in an automated way is a crucial and complex task as it is constrained with the available time budget and resources. This encouraged the research community to introduce several solutions to utilize the available time and resources. A lot of work is done to suggest the most promising classifiers for a given dataset using sundry of techniques including meta-learning based techniques. This gives the autoML framework the chance to spend more time exploiting those classifiers and tuning their hyper-parameters. In this paper, we empirically study the hypothesis of improving the pipeline performance by exploiting the most promising classifiers within the limited time budget. We also study the effect of increasing the time budget over the pipeline performance. The empirical results across autoSKLearn, TPOT and ATM, show that exploiting the most promising classifiers does not achieve a statistically better performance than exploring the entire search space. The same conclusion is also applied for long time budgets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes