Maksym Kholiavchenko

9.1LGMar 24, 2019Code

MUSCO: Multi-Stage Compression of neural networks

Julia Gusak, Maksym Kholiavchenko, Evgeny Ponomarev et al.

The low-rank tensor approximation is very promising for the compression of deep neural networks. We propose a new simple and efficient iterative approach, which alternates low-rank factorization with a smart rank selection and fine-tuning. We demonstrate the efficiency of our method comparing to non-iterative ones. Our approach improves the compression rate while maintaining the accuracy for a variety of tasks.

3.3CVMar 23, 2018

Iterative Low-Rank Approximation for CNN Compression

Maksym Kholiavchenko

Deep convolutional neural networks contain tens of millions of parameters, making them impossible to work efficiently on embedded devices. We propose iterative approach of applying low-rank approximation to compress deep convolutional neural networks. Since classification and object detection are the most favored tasks for embedded devices, we demonstrate the effectiveness of our approach by compressing AlexNet, VGG-16, YOLOv2 and Tiny YOLO networks. Our results show the superiority of the proposed method compared to non-repetitive ones. We demonstrate higher compression ratio providing less accuracy loss.

Maksym Kholiavchenko

2 Papers