LG MLOct 21, 2019

Building Efficient CNNs Using Depthwise Convolutional Eigen-Filters (DeCEF)

Yinan Yu, Samuel Scheidegger, Tomas McKelvey

arXiv:1910.09359v31.83 citations

Originality Incremental advance

AI Analysis

This addresses the need for efficient CNNs in applications where pre-trained models are unavailable, offering a domain-specific compression method.

The paper tackles the problem of reducing CNN complexity without needing pre-trained models by introducing Depthwise Convolutional Eigen-Filters (DeCEF), a low-rank alternative to Conv2D layers, achieving similar or higher accuracy on CIFAR-10 and ImageNet with about 2/3 of the original parameters and FLOPs.

Deep Convolutional Neural Networks (CNNs) have been widely used in various domains due to their impressive capabilities. These models are typically composed of a large number of 2D convolutional (Conv2D) layers with numerous trainable parameters. To reduce the complexity of a network, compression techniques can be applied. These methods typically rely on the analysis of trained deep learning models. However, in some applications, due to reasons such as particular data or system specifications and licensing restrictions, a pre-trained network may not be available. This would require the user to train a CNN from scratch. In this paper, we aim to find an alternative parameterization to Conv2D filters without relying on a pre-trained convolutional network. During the analysis, we observe that the effective rank of the vectorized Conv2D filters decreases with respect to the increasing depth in the network, which then leads to the implementation of the Depthwise Convolutional Eigen-Filter (DeCEF) layer. Essentially, a DeCEF layer is a low rank version of the Conv2D layer with significantly fewer trainable parameters and floating point operations (FLOPs). The way we define the effective rank is different from the previous work and it is easy to implement in any deep learning frameworks. To evaluate the effectiveness of DeCEF, experiments are conducted on the benchmark datasets CIFAR-10 and ImageNet using various network architectures. The results have shown a similar or higher accuracy and robustness using about 2/3 of the original parameters and reducing the number of FLOPs to 2/3 of the base network, which is then compared to the state-of-the-art techniques.

View on arXiv PDF

Similar