CVNov 27, 2019

CSPNet: A New Backbone that can Enhance Learning Capability of CNN

Chien-Yao Wang, Hong-Yuan Mark Liao, I-Hau Yeh, Yueh-Hua Wu, Ping-Yang Chen, Jun-Wei Hsieh

arXiv:1911.11929v137.64143 citationsHas Code

Originality Highly original

AI Analysis

This addresses the issue of resource-intensive inference for users with limited hardware, offering a practical improvement for deploying advanced vision models.

The paper tackles the problem of high computational cost in CNNs for computer vision by proposing CSPNet, which reduces computations by 20% on ImageNet while maintaining or improving accuracy and outperforming state-of-the-art methods on MS COCO object detection.

Neural networks have enabled state-of-the-art approaches to achieve incredible results on computer vision tasks such as object detection. However, such success greatly relies on costly computation resources, which hinders people with cheap devices from appreciating the advanced technology. In this paper, we propose Cross Stage Partial Network (CSPNet) to mitigate the problem that previous works require heavy inference computations from the network architecture perspective. We attribute the problem to the duplicate gradient information within network optimization. The proposed networks respect the variability of the gradients by integrating feature maps from the beginning and the end of a network stage, which, in our experiments, reduces computations by 20% with equivalent or even superior accuracy on the ImageNet dataset, and significantly outperforms state-of-the-art approaches in terms of AP50 on the MS COCO object detection dataset. The CSPNet is easy to implement and general enough to cope with architectures based on ResNet, ResNeXt, and DenseNet. Source code is at https://github.com/WongKinYiu/CrossStagePartialNetworks.

View on arXiv PDF Code

Similar