CVLGMar 30, 2020

Designing Network Design Spaces

arXiv:2003.13678v12129 citations
AI Analysis

This work addresses the challenge of discovering generalizable network design principles for machine learning practitioners, offering a novel approach that is not incremental.

The authors tackled the problem of network design by proposing a new paradigm that designs network design spaces instead of individual instances, resulting in RegNet models that outperform EfficientNet by up to 5x faster on GPUs under comparable settings.

In this work, we present a new network design paradigm. Our goal is to help advance the understanding of network design and discover design principles that generalize across settings. Instead of focusing on designing individual network instances, we design network design spaces that parametrize populations of networks. The overall process is analogous to classic manual design of networks, but elevated to the design space level. Using our methodology we explore the structure aspect of network design and arrive at a low-dimensional design space consisting of simple, regular networks that we call RegNet. The core insight of the RegNet parametrization is surprisingly simple: widths and depths of good networks can be explained by a quantized linear function. We analyze the RegNet design space and arrive at interesting findings that do not match the current practice of network design. The RegNet design space provides simple and fast networks that work well across a wide range of flop regimes. Under comparable training settings and flops, the RegNet models outperform the popular EfficientNet models while being up to 5x faster on GPUs.

Code Implementations26 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes