CVLGDec 6, 2020

Any-Width Networks

arXiv:2012.03153v19 citations
AI Analysis

This work addresses the challenge of deploying CNNs in resource-constrained practical applications where computational budgets and performance needs vary, offering a solution for practitioners needing adaptable models.

The paper proposes Any-Width Networks (AWNs), an adjustable-width CNN architecture that allows fine-grained control over speed and accuracy during inference. This is achieved through lower-triangular weight matrices that handle width-varying batch statistics and facilitate efficient training via random width sampling.

Despite remarkable improvements in speed and accuracy, convolutional neural networks (CNNs) still typically operate as monolithic entities at inference time. This poses a challenge for resource-constrained practical applications, where both computational budgets and performance needs can vary with the situation. To address these constraints, we propose the Any-Width Network (AWN), an adjustable-width CNN architecture and associated training routine that allow for fine-grained control over speed and accuracy during inference. Our key innovation is the use of lower-triangular weight matrices which explicitly address width-varying batch statistics while being naturally suited for multi-width operations. We also show that this design facilitates an efficient training routine based on random width sampling. We empirically demonstrate that our proposed AWNs compare favorably to existing methods while providing maximally granular control during inference.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes