CVFeb 19, 2015

Unsupervised Network Pretraining via Encoding Human Design

arXiv:1502.05689v2
AI Analysis

This addresses the challenge of incorporating prior research knowledge into neural network training for computer vision, though it is incremental as it builds on existing feature extraction methods.

The paper tackles the problem of training deep neural networks for visual object recognition by pretraining them to replicate hand-designed feature extraction processes, such as histogram of oriented gradients and region covariance, and achieves substantially better performance than baseline methods after finetuning.

Over the years, computer vision researchers have spent an immense amount of effort on designing image features for the visual object recognition task. We propose to incorporate this valuable experience to guide the task of training deep neural networks. Our idea is to pretrain the network through the task of replicating the process of hand-designed feature extraction. By learning to replicate the process, the neural network integrates previous research knowledge and learns to model visual objects in a way similar to the hand-designed features. In the succeeding finetuning step, it further learns object-specific representations from labeled data and this boosts its classification power. We pretrain two convolutional neural networks where one replicates the process of histogram of oriented gradients feature extraction, and the other replicates the process of region covariance feature extraction. After finetuning, we achieve substantially better performance than the baseline methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes