NELGOct 3, 2014

SimNets: A Generalization of Convolutional Networks

arXiv:1410.0781v317 citations
Originality Incremental advance
AI Analysis

This provides a more efficient architecture for deep learning practitioners, though it appears incremental as a generalization of existing ConvNets.

The authors tackled the problem of generalizing convolutional neural networks (ConvNets) by introducing SimNets, a deep layered architecture that achieves state-of-the-art accuracy with networks an order of magnitude smaller than comparable ConvNets.

We present a deep layered architecture that generalizes classical convolutional neural networks (ConvNets). The architecture, called SimNets, is driven by two operators, one being a similarity function whose family contains the convolution operator used in ConvNets, and the other is a new soft max-min-mean operator called MEX that realizes classical operators like ReLU and max pooling, but has additional capabilities that make SimNets a powerful generalization of ConvNets. Three interesting properties emerge from the architecture: (i) the basic input to hidden layer to output machinery contains as special cases kernel machines with the Exponential and Generalized Gaussian kernels, the output units being "neurons in feature space" (ii) in its general form, the basic machinery has a higher abstraction level than kernel machines, and (iii) initializing networks using unsupervised learning is natural. Experiments demonstrate the capability of achieving state of the art accuracy with networks that are an order of magnitude smaller than comparable ConvNets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes