CVAIJan 7, 2021

L2PF -- Learning to Prune Faster

arXiv:2101.02663v15 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of accelerating CNN compression for autonomous driving applications, where rapid deployment of new features is critical.

The paper introduces L2PF, a multi-task learning method for faster pruning of convolutional neural networks. It discretely learns redundant filters and continuously determines fine-tuning duration, achieving a 3.84x compression ratio on ResNet20 with minimal accuracy loss and reducing GPU hours by 1.71x compared to state-of-the-art pruning.

Various applications in the field of autonomous driving are based on convolutional neural networks (CNNs), especially for processing camera data. The optimization of such CNNs is a major challenge in continuous development. Newly learned features must be brought into vehicles as quickly as possible, and as such, it is not feasible to spend redundant GPU hours during compression. In this context, we present Learning to Prune Faster which details a multi-task, try-and-learn method, discretely learning redundant filters of the CNN and a continuous action of how long the layers have to be fine-tuned. This allows us to significantly speed up the convergence process of learning how to find an embedded-friendly filter-wise pruned CNN. For ResNet20, we have achieved a compression ratio of 3.84 x with minimal accuracy degradation. Compared to the state-of-the-art pruning method, we reduced the GPU hours by 1.71 x.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes