CVJul 14, 2017

Generalizing the Convolution Operator in Convolutional Neural Networks

arXiv:1707.09864v143 citations
Originality Incremental advance
AI Analysis

This work addresses a foundational limitation in CNNs for machine vision, though it is incremental as it builds on existing convolution concepts.

The paper tackles the problem of generalizing the convolution operator in CNNs by proposing surrogate functions based on positive definite kernels and similarity measures, with experiments on MNIST showing that generalized CNNs using weighted L1/L2 distances achieve performance comparable to ordinary CNNs.

Convolutional neural networks have become a main tool for solving many machine vision and machine learning problems. A major element of these networks is the convolution operator which essentially computes the inner product between a weight vector and the vectorized image patches extracted by sliding a window in the image planes of the previous layer. In this paper, we propose two classes of surrogate functions for the inner product operation inherent in the convolution operator and so attain two generalizations of the convolution operator. The first one is the class of positive definite kernel functions where their application is justified by the kernel trick. The second one is the class of similarity measures defined based on a distance function. We justify this by tracing back to the basic idea behind the neocognitron which is the ancestor of CNNs. Both methods are then further generalized by allowing a monotonically increasing function to be applied subsequently. Like any trainable parameter in a neural network, the template pattern and the parameters of the kernel/distance function are trained with the back-propagation algorithm. As an aside, we use the proposed framework to justify the use of sine activation function in CNNs. Our experiments on the MNIST dataset show that the performance of ordinary CNNs can be achieved by generalized CNNs based on weighted L1/L2 distances, proving the applicability of the proposed generalization of the convolutional neural networks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes