CVLGNEFeb 9, 2016

The Role of Typicality in Object Classification: Improving The Generalization Capacity of Convolutional Neural Networks

arXiv:1602.02865v11 citations
Originality Incremental advance
AI Analysis

This addresses a generalization issue in computer vision for improving CNN robustness, but it is incremental as it builds on existing models.

The paper tackles the problem of convolutional neural networks failing to generalize to atypical images, and shows that incorporating a typicality measure improves classification results on new image sets by a large margin without fine-tuning.

Deep artificial neural networks have made remarkable progress in different tasks in the field of computer vision. However, the empirical analysis of these models and investigation of their failure cases has received attention recently. In this work, we show that deep learning models cannot generalize to atypical images that are substantially different from training images. This is in contrast to the superior generalization ability of the visual system in the human brain. We focus on Convolutional Neural Networks (CNN) as the state-of-the-art models in object recognition and classification; investigate this problem in more detail, and hypothesize that training CNN models suffer from unstructured loss minimization. We propose computational models to improve the generalization capacity of CNNs by considering how typical a training image looks like. By conducting an extensive set of experiments we show that involving a typicality measure can improve the classification results on a new set of images by a large margin. More importantly, this significant improvement is achieved without fine-tuning the CNN model on the target image set.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes