CV LG NEFeb 9, 2016

The Role of Typicality in Object Classification: Improving The Generalization Capacity of Convolutional Neural Networks

Babak Saleh, Ahmed Elgammal, Jacob Feldman

arXiv:1602.02865v12.11 citations

Originality Incremental advance

AI Analysis

This addresses a generalization issue in computer vision for improving CNN robustness, but it is incremental as it builds on existing models.

The paper tackles the problem of convolutional neural networks failing to generalize to atypical images, and shows that incorporating a typicality measure improves classification results on new image sets by a large margin without fine-tuning.

Deep artificial neural networks have made remarkable progress in different tasks in the field of computer vision. However, the empirical analysis of these models and investigation of their failure cases has received attention recently. In this work, we show that deep learning models cannot generalize to atypical images that are substantially different from training images. This is in contrast to the superior generalization ability of the visual system in the human brain. We focus on Convolutional Neural Networks (CNN) as the state-of-the-art models in object recognition and classification; investigate this problem in more detail, and hypothesize that training CNN models suffer from unstructured loss minimization. We propose computational models to improve the generalization capacity of CNNs by considering how typical a training image looks like. By conducting an extensive set of experiments we show that involving a typicality measure can improve the classification results on a new set of images by a large margin. More importantly, this significant improvement is achieved without fine-tuning the CNN model on the target image set.

View on arXiv PDF

Similar