LGMay 26, 2023

DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models

arXiv:2305.16943v421 citationsHas Code
Originality Highly original
AI Analysis

This work addresses the computational bottleneck in neural architecture search for machine learning practitioners, offering a more efficient paradigm shift from NAS to generation.

The paper tackles the inefficiency of existing Neural Architecture Search (NAS) methods by proposing DiffusionNAG, a conditional Neural Architecture Generation framework using diffusion models, which achieves up to 35 times speedup on Transferable NAS benchmarks and outperforms Bayesian Optimization-based NAS approaches on ImageNet.

Existing NAS methods suffer from either an excessive amount of time for repetitive sampling and training of many task-irrelevant architectures. To tackle such limitations of existing NAS methods, we propose a paradigm shift from NAS to a novel conditional Neural Architecture Generation (NAG) framework based on diffusion models, dubbed DiffusionNAG. Specifically, we consider the neural architectures as directed graphs and propose a graph diffusion model for generating them. Moreover, with the guidance of parameterized predictors, DiffusionNAG can flexibly generate task-optimal architectures with the desired properties for diverse tasks, by sampling from a region that is more likely to satisfy the properties. This conditional NAG scheme is significantly more efficient than previous NAS schemes which sample the architectures and filter them using the property predictors. We validate the effectiveness of DiffusionNAG through extensive experiments in two predictor-based NAS scenarios: Transferable NAS and Bayesian Optimization (BO)-based NAS. DiffusionNAG achieves superior performance with speedups of up to 35 times when compared to the baselines on Transferable NAS benchmarks. Furthermore, when integrated into a BO-based algorithm, DiffusionNAG outperforms existing BO-based NAS approaches, particularly in the large MobileNetV3 search space on the ImageNet 1K dataset. Code is available at https://github.com/CownowAn/DiffusionNAG.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes