CVApr 17, 2020

Adaptive Neuron-wise Discriminant Criterion and Adaptive Center Loss at Hidden Layer for Deep Convolutional Neural Network

Motoshi Abe, Junichi Miyao, Takio Kurita

arXiv:2004.08074v11.2Has Code

Originality Incremental advance

AI Analysis

This work addresses feature discriminability in CNNs for image classification, offering an incremental improvement over existing methods.

The paper tackled the problem of non-discriminative deep features in CNNs by proposing an adaptive neuron-wise discriminant criterion at the output layer and adaptive center loss at a hidden layer, achieving improved classification accuracy on datasets like CIFAR10 and CIFAR100 with concrete gains such as a 0.5% increase on CIFAR10.

A deep convolutional neural network (CNN) has been widely used in image classification and gives better classification accuracy than the other techniques. The softmax cross-entropy loss function is often used for classification tasks. There are some works to introduce the additional terms in the objective function for training to make the features of the output layer more discriminative. The neuron-wise discriminant criterion makes the input feature of each neuron in the output layer discriminative by introducing the discriminant criterion to each of the features. Similarly, the center loss was introduced to the features before the softmax activation function for face recognition to make the deep features discriminative. The ReLU function is often used for the network as an active function in the hidden layers of the CNN. However, it is observed that the deep features trained by using the ReLU function are not discriminative enough and show elongated shapes. In this paper, we propose to use the neuron-wise discriminant criterion at the output layer and the center-loss at the hidden layer. Also, we introduce the online computation of the means of each class with the exponential forgetting. We named them adaptive neuron-wise discriminant criterion and adaptive center loss, respectively. The effectiveness of the integration of the adaptive neuron-wise discriminant criterion and the adaptive center loss is shown by the experiments with MNSIT, FashionMNIST, CIFAR10, CIFAR100, and STL10. Source code is at https://github.com/i13abe/Adaptive-discriminant-and-center

View on arXiv PDF Code

Similar