LG CL MLJun 28, 2019

FIESTA: Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms

Henry B. Moss, Andrew Moore, David S. Leslie, Paul Rayson

arXiv:1906.12230v151.21093 citationsh-index: 58Has Code

Originality Incremental advance

AI Analysis

This addresses the issue of inefficient and unreliable model selection in machine learning, particularly for shared tasks, though it is incremental as it builds on existing bandit theory.

The paper tackles the problem of reliably identifying state-of-the-art models from large collections by reducing computational resources, showing that FIESTA selects between 8 sentiment analysis methods using dramatically fewer evaluations than current approaches.

We present FIESTA, a model selection approach that significantly reduces the computational resources required to reliably identify state-of-the-art performance from large collections of candidate models. Despite being known to produce unreliable comparisons, it is still common practice to compare model evaluations based on single choices of random seeds. We show that reliable model selection also requires evaluations based on multiple train-test splits (contrary to common practice in many shared tasks). Using bandit theory from the statistics literature, we are able to adaptively determine appropriate numbers of data splits and random seeds used to evaluate each model, focusing computational resources on the evaluation of promising models whilst avoiding wasting evaluations on models with lower performance. Furthermore, our user-friendly Python implementation produces confidence guarantees of correctly selecting the optimal model. We evaluate our algorithms by selecting between 8 target-dependent sentiment analysis methods using dramatically fewer model evaluations than current model selection approaches.

View on arXiv PDF Code

Similar