CVApr 24, 2016

Rotation-Invariant Restricted Boltzmann Machine Using Shared Gradient Filters

arXiv:1604.07045v22 citations
AI Analysis

This is an incremental improvement for computer vision tasks where rotation invariance is crucial, such as image recognition.

The paper tackled the problem of Restricted Boltzmann Machines (RBMs) lacking invariance to transformations like rotations by proposing an Explicit Rotation-Invariant RBM that uses dominant image orientations to learn rotation-invariant features. The result showed that the method learns compact features robust to rotations while requiring fewer hidden units, as demonstrated on the MNIST benchmark dataset.

Finding suitable features has been an essential problem in computer vision. We focus on Restricted Boltzmann Machines (RBMs), which, despite their versatility, cannot accommodate transformations that may occur in the scene. As a result, several approaches have been proposed that consider a set of transformations, which are used to either augment the training set or transform the actual learned filters. In this paper, we propose the Explicit Rotation-Invariant Restricted Boltzmann Machine, which exploits prior information coming from the dominant orientation of images. Our model extends the standard RBM, by adding a suitable number of weight matrices, associated with each dominant gradient. We show that our approach is able to learn rotation-invariant features, comparing it with the classic formulation of RBM on the MNIST benchmark dataset. Overall, requiring less hidden units, our method learns compact features, which are robust to rotations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes