CVLGJun 30, 2012

Differentiable Pooling for Hierarchical Feature Learning

arXiv:1207.0151v116 citations
Originality Incremental advance
AI Analysis

This addresses the issue of suboptimal pooling in hierarchical feature learning for computer vision, though it is incremental as it builds on existing deconvolutional network models.

The paper tackles the problem of heuristic-based pooling in hierarchical models by introducing a parametric Gaussian pooling method that can be optimized jointly with features in a global objective, resulting in a what/where decomposition of input signals and demonstrated improvements on MNIST digits.

We introduce a parametric form of pooling, based on a Gaussian, which can be optimized alongside the features in a single global objective function. By contrast, existing pooling schemes are based on heuristics (e.g. local maximum) and have no clear link to the cost function of the model. Furthermore, the variables of the Gaussian explicitly store location information, distinct from the appearance captured by the features, thus providing a what/where decomposition of the input signal. Although the differentiable pooling scheme can be incorporated in a wide range of hierarchical models, we demonstrate it in the context of a Deconvolutional Network model (Zeiler et al. ICCV 2011). We also explore a number of secondary issues within this model and present detailed experiments on MNIST digits.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes