LGMLNov 8, 2018

Alpha-Integration Pooling for Convolutional Neural Networks

arXiv:1811.03436v45 citations
AI Analysis

This addresses the need for data-driven pooling in CNNs for image recognition, but it is incremental as it builds on existing pooling concepts.

The paper tackles the problem of selecting optimal pooling types in convolutional neural networks for image recognition by proposing α-integration pooling, a trainable method that includes max-pooling and arithmetic average-pooling as special cases, and experiments show it outperforms other pooling methods.

Convolutional neural networks (CNNs) have achieved remarkable performance in many applications, especially in image recognition tasks. As a crucial component of CNNs, sub-sampling plays an important role for efficient training or invariance property, and max-pooling and arithmetic average-pooling are commonly used sub-sampling methods. In addition to the two pooling methods, however, there could be many other pooling types, such as geometric average, harmonic average, and so on. Since it is not easy for algorithms to find the best pooling method, usually the pooling types are assumed a priority, which might not be optimal for different tasks. In line with the deep learning philosophy, the type of pooling can be driven by data for a given task. In this paper, we propose {\it $α$-integration pooling} ($α$I-pooling), which has a trainable parameter $α$ to find the type of pooling. $α$I-pooling is a general pooling method including max-pooling and arithmetic average-pooling as a special case, depending on the parameter $α$. Experiments show that $α$I-pooling outperforms other pooling methods including max-pooling, in image recognition tasks. Also, it turns out that each layer has different optimal pooling type.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes