LG MLJun 1, 2018

TAPAS: Train-less Accuracy Predictor for Architecture Search

R. Istrate, F. Scheidegger, G. Mariani, D. Nikolopoulos, C. Bekas, A. C. I. Malossi

arXiv:1806.00250v117.978 citations

Originality Incremental advance

AI Analysis

This enables fast and resource-efficient architecture search for researchers and practitioners, though it is incremental as it builds on existing predictor methods.

The paper tackles the problem of neural architecture search requiring extensive training by proposing a train-less accuracy predictor that estimates classification performance for unseen datasets in fractions of a second, achieving 93.67% accuracy on CIFAR-10 and 81.01% on CIFAR-100 with searches completed in 400 seconds on a single GPU.

In recent years an increasing number of researchers and practitioners have been suggesting algorithms for large-scale neural network architecture search: genetic algorithms, reinforcement learning, learning curve extrapolation, and accuracy predictors. None of them, however, demonstrated high-performance without training new experiments in the presence of unseen datasets. We propose a new deep neural network accuracy predictor, that estimates in fractions of a second classification performance for unseen input datasets, without training. In contrast to previously proposed approaches, our prediction is not only calibrated on the topological network information, but also on the characterization of the dataset-difficulty which allows us to re-tune the prediction without any training. Our predictor achieves a performance which exceeds 100 networks per second on a single GPU, thus creating the opportunity to perform large-scale architecture search within a few minutes. We present results of two searches performed in 400 seconds on a single GPU. Our best discovered networks reach 93.67% accuracy for CIFAR-10 and 81.01% for CIFAR-100, verified by training. These networks are performance competitive with other automatically discovered state-of-the-art networks however we only needed a small fraction of the time to solution and computational resources.

View on arXiv PDF

Similar