CVDec 23, 2018

Leveraging Class Similarity to Improve Deep Neural Network Robustness

arXiv:1812.09744v2
Originality Incremental advance
AI Analysis

This addresses the issue of misclassification penalties in neural networks for researchers and practitioners, but it is incremental as it modifies an existing loss function.

The paper tackled the problem of penalizing neural networks equally for all misclassifications by introducing a cross-entropy loss variation that incorporates a data-driven class-similarity distribution, resulting in slightly better generalization and improved performance on noisy testing scenarios across multiple architectures and datasets.

Traditionally artificial neural networks (ANNs) are trained by minimizing the cross-entropy between a provided groundtruth delta distribution (encoded as one-hot vector) and the ANN's predictive softmax distribution. It seems, however, unacceptable to penalize networks equally for missclassification between classes. Confusing the class "Automobile" with the class "Truck" should be penalized less than confusing the class "Automobile" with the class "Donkey". To avoid such representation issues and learn cleaner classification boundaries in the network, this paper presents a variation of cross-entropy loss which depends not only on the sample class but also on a data-driven prior "class-similarity distribution" across the classes encoded in a matrix form. We explore learning the class-similarity distribution using a datadriven method and then show that by training with our modified similarity-driven loss, we obtain slightly better generalization performance over multiple architectures and datasets as well as improved performance on noisy testing scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes