A Novel Weight-Shared Multi-Stage CNN for Scale Robustness
This addresses a key limitation in CNNs for image classification, offering a practical solution to improve scale robustness, though it is incremental as it builds on existing deep CNNs.
The paper tackles the limited robustness of CNNs to object scaling by proposing a weight-shared multi-stage network (WSMS-Net) that integrates with existing architectures like ResNet and DenseNet, resulting in higher accuracies on CIFAR-10, CIFAR-100, and ImageNet datasets with minimal increases in parameters and computation time.
Convolutional neural networks (CNNs) have demonstrated remarkable results in image classification for benchmark tasks and practical applications. The CNNs with deeper architectures have achieved even higher performance recently thanks to their robustness to the parallel shift of objects in images as well as their numerous parameters and the resulting high expression ability. However, CNNs have a limited robustness to other geometric transformations such as scaling and rotation. This limits the performance improvement of the deep CNNs, but there is no established solution. This study focuses on scale transformation and proposes a network architecture called the weight-shared multi-stage network (WSMS-Net), which consists of multiple stages of CNNs. The proposed WSMS-Net is easily combined with existing deep CNNs such as ResNet and DenseNet and enables them to acquire robustness to object scaling. Experimental results on the CIFAR-10, CIFAR-100, and ImageNet datasets demonstrate that existing deep CNNs combined with the proposed WSMS-Net achieve higher accuracies for image classification tasks with only a minor increase in the number of parameters and computation time.