LGMLFeb 21, 2019

Evaluating the Search Phase of Neural Architecture Search

arXiv:1902.08142v3370 citations
AI Analysis

This work addresses a critical evaluation gap for researchers in automated machine learning, though it is incremental as it focuses on assessment rather than proposing new methods.

The paper tackles the problem of evaluating the search phase in Neural Architecture Search (NAS) by comparing state-of-the-art NAS algorithms to random architecture selection, finding that on average they perform similarly and that weight sharing degrades candidate rankings.

Neural Architecture Search (NAS) aims to facilitate the design of deep networks for new tasks. Existing techniques rely on two stages: searching over the architecture space and validating the best architecture. NAS algorithms are currently compared solely based on their results on the downstream task. While intuitive, this fails to explicitly evaluate the effectiveness of their search strategies. In this paper, we propose to evaluate the NAS search phase. To this end, we compare the quality of the solutions obtained by NAS search policies with that of random architecture selection. We find that: (i) On average, the state-of-the-art NAS algorithms perform similarly to the random policy; (ii) the widely-used weight sharing strategy degrades the ranking of the NAS candidates to the point of not reflecting their true performance, thus reducing the effectiveness of the search process. We believe that our evaluation framework will be key to designing NAS strategies that consistently discover architectures superior to random ones.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes