CVJan 1, 2023

GoogLe2Net: Going Transverse with Convolutions

arXiv:2301.00424v11.52 citationsh-index: 9

Originality Incremental advance

AI Analysis

This work addresses image classification for computer vision researchers, presenting an incremental improvement by combining existing ideas like residual connections and multi-scale features.

The paper tackles the problem of effectively capturing feature information in vision tasks by proposing GoogLe2Net, a novel CNN architecture with residual feature-reutilization inceptions that improve image classification performance, achieving results such as 97.94% on CIFAR10 and 85.91% on CIFAR100.

Capturing feature information effectively is of great importance in vision tasks. With the development of convolutional neural networks (CNNs), concepts like residual connection and multiple scales promote continual performance gains on diverse deep learning vision tasks. However, the existing methods do not organically combined advantages of these valid ideas. In this paper, we propose a novel CNN architecture called GoogLe2Net, it consists of residual feature-reutilization inceptions (ResFRI) or split residual feature-reutilization inceptions (Split-ResFRI) which create transverse passages between adjacent groups of convolutional layers to enable features flow to latter processing branches and possess residual connections to better process information. Our GoogLe2Net is able to reutilize information captured by foregoing groups of convolutional layers and express multi-scale features at a fine-grained level, which improves performances in image classification. And the inception we proposed could be embedded into inception-like networks directly without any migration costs. Moreover, in experiments based on popular vision datasets, such as CIFAR10 (97.94%), CIFAR100 (85.91%) and Tiny Imagenet (70.54%), we obtain better results on image classification task compared with other modern models.

View on arXiv PDF

Similar