CVOct 28, 2018

Distilling Critical Paths in Convolutional Neural Networks

arXiv:1811.02643v28.724 citations

Originality Incremental advance

AI Analysis

This addresses resource constraints for deploying neural networks, but is incremental as it builds on existing compression techniques.

The paper tackled the problem of neural network compression by identifying that higher-layer filters in CNNs learn class-exclusive features, and proposed a critical path distillation method to reduce model size and computation.

Neural network compression and acceleration are widely demanded currently due to the resource constraints on most deployment targets. In this paper, through analyzing the filter activation, gradients, and visualizing the filters' functionality in convolutional neural networks, we show that the filters in higher layers learn extremely task-specific features, which are exclusive for only a small subset of the overall tasks, or even a single class. Based on such findings, we reveal the critical paths of information flow for different classes. And by their intrinsic property of exclusiveness, we propose a critical path distillation method, which can effectively customize the convolutional neural networks to small ones with much smaller model size and less computation.

View on arXiv PDF

Similar