CVAIMay 9, 2023

GPT-NAS: Evolutionary Neural Architecture Search with the Generative Pre-Trained Model

arXiv:2305.05351v410 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of inefficient architecture search in machine learning, offering a novel hybrid method that is incremental but shows strong gains in specific domains like neural network design.

The paper tackles the inefficiency of Neural Architecture Search (NAS) due to huge search spaces by proposing GPT-NAS, which uses a Generative Pre-Trained model with an evolutionary algorithm to reduce the search space and improve performance. It shows significant outperformance over manually designed and competing NAS architectures, with up to a 12% improvement in finely tuned neural architectures.

Neural Architecture Search (NAS) has emerged as one of the effective methods to design the optimal neural network architecture automatically. Although neural architectures have achieved human-level performances in several tasks, few of them are obtained from the NAS method. The main reason is the huge search space of neural architectures, making NAS algorithms inefficient. This work presents a novel architecture search algorithm, called GPT-NAS, that optimizes neural architectures by Generative Pre-Trained (GPT) model with an evolutionary algorithm (EA) as the search strategy. In GPT-NAS, we assume that a generative model pre-trained on a large-scale corpus could learn the fundamental law of building neural architectures. Therefore, GPT-NAS leverages the GPT model to propose reasonable architecture components given the basic one and then utilizes EAs to search for the optimal solution. Such an approach can largely reduce the search space by introducing prior knowledge in the search process. Extensive experimental results show that our GPT-NAS method significantly outperforms seven manually designed neural architectures and thirteen architectures provided by competing NAS methods. In addition, our experiments also indicate that the proposed algorithm improves the performance of finely tuned neural architectures by up to about 12% compared to those without GPT, further demonstrating its effectiveness in searching neural architectures.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes