CVFeb 7, 2025

Training-free Neural Architecture Search through Variance of Knowledge of Deep Network Weights

arXiv:2502.04975v12 citationsh-index: 1Has CodeCVPR
Originality Highly original
AI Analysis

This addresses the computational bottleneck in NAS for researchers and practitioners, though it is an incremental improvement over existing training-free methods.

The paper tackles the high computational cost of Neural Architecture Search (NAS) by proposing a training-free proxy based on Fisher Information to estimate image classification accuracy without training networks, achieving state-of-the-art results on three public datasets and two search spaces.

Deep learning has revolutionized computer vision, but it achieved its tremendous success using deep network architectures which are mostly hand-crafted and therefore likely suboptimal. Neural Architecture Search (NAS) aims to bridge this gap by following a well-defined optimization paradigm which systematically looks for the best architecture, given objective criterion such as maximal classification accuracy. The main limitation of NAS is however its astronomical computational cost, as it typically requires training each candidate network architecture from scratch. In this paper, we aim to alleviate this limitation by proposing a novel training-free proxy for image classification accuracy based on Fisher Information. The proposed proxy has a strong theoretical background in statistics and it allows estimating expected image classification accuracy of a given deep network without training the network, thus significantly reducing computational cost of standard NAS algorithms. Our training-free proxy achieves state-of-the-art results on three public datasets and in two search spaces, both when evaluated using previously proposed metrics, as well as using a new metric that we propose which we demonstrate is more informative for practical NAS applications. The source code is publicly available at http://www.github.com/ondratybl/VKDNW

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes