LGCVMLMay 6, 2019

Differentiable Architecture Search with Ensemble Gumbel-Softmax

arXiv:1905.01786v119 citations
Originality Incremental advance
AI Analysis

This work addresses the efficiency bottleneck in neural architecture search for machine learning practitioners, though it appears incremental as it builds on existing differentiable NAS approaches.

The paper tackles the challenge of balancing effectiveness and efficiency in neural architecture search by developing a differentiable NAS method that uses an ensemble Gumbel-Softmax estimator to optimize both architecture and parameters end-to-end, achieving high-performance architectures with guaranteed efficiency across multiple datasets.

For network architecture search (NAS), it is crucial but challenging to simultaneously guarantee both effectiveness and efficiency. Towards achieving this goal, we develop a differentiable NAS solution, where the search space includes arbitrary feed-forward network consisting of the predefined number of connections. Benefiting from a proposed ensemble Gumbel-Softmax estimator, our method optimizes both the architecture of a deep network and its parameters in the same round of backward propagation, yielding an end-to-end mechanism of searching network architectures. Extensive experiments on a variety of popular datasets strongly evidence that our method is capable of discovering high-performance architectures, while guaranteeing the requisite efficiency during searching.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes