LG CV IVSep 5, 2024

Memory-Optimized Once-For-All Network

Maxime Girard, Victor Quétu, Samuel Tardieu, Van-Tam Nguyen, Enzo Tartaglione

arXiv:2409.05900v14.62 citationsh-index: 14Has Code

Originality Synthesis-oriented

AI Analysis

This is an incremental improvement for deploying neural networks on hardware with varying constraints.

The paper tackles the challenge of deploying deep neural networks on resource-limited devices by introducing a Memory-Optimized Once-For-All (MOOFA) supernet, which improves memory exploitation and model accuracy compared to the original OFA supernet on ImageNet.

Deploying Deep Neural Networks (DNNs) on different hardware platforms is challenging due to varying resource constraints. Besides handcrafted approaches aiming at making deep models hardware-friendly, Neural Architectures Search is rising as a toolbox to craft more efficient DNNs without sacrificing performance. Among these, the Once-For-All (OFA) approach offers a solution by allowing the sampling of well-performing sub-networks from a single supernet -- this leads to evident advantages in terms of computation. However, OFA does not fully utilize the potential memory capacity of the target device, focusing instead on limiting maximum memory usage per layer. This leaves room for an unexploited potential in terms of model generalizability. In this paper, we introduce a Memory-Optimized OFA (MOOFA) supernet, designed to enhance DNN deployment on resource-limited devices by maximizing memory usage (and for instance, features diversity) across different configurations. Tested on ImageNet, our MOOFA supernet demonstrates improvements in memory exploitation and model accuracy compared to the original OFA supernet. Our code is available at https://github.com/MaximeGirard/memory-optimized-once-for-all.

View on arXiv PDF Code

Similar