LGNov 16, 2020

Redesigning the classification layer by randomizing the class representation vectors

arXiv:2011.08704v24 citations
AI Analysis

This addresses an incremental improvement in neural network training dynamics for image classification tasks.

The paper tackles the problem of visual similarities between classes in neural image classification models, which can deteriorate accuracy or prevent convergence, by proposing to randomize and fix class representation vectors during training. The result shows increased inter-class separability, intra-class compactness, and overall model accuracy, while maintaining robustness to image corruptions and generalization.

Neural image classification models typically consist of two components. The first is an image encoder, which is responsible for encoding a given raw image into a representative vector. The second is the classification component, which is often implemented by projecting the representative vector onto target class vectors. The target class vectors, along with the rest of the model parameters, are estimated so as to minimize the loss function. In this paper, we analyze how simple design choices for the classification layer affect the learning dynamics. We show that the standard cross-entropy training implicitly captures visual similarities between different classes, which might deteriorate accuracy or even prevents some models from converging. We propose to draw the class vectors randomly and set them as fixed during training, thus invalidating the visual similarities encoded in these vectors. We analyze the effects of keeping the class vectors fixed and show that it can increase the inter-class separability, intra-class compactness, and the overall model accuracy, while maintaining the robustness to image corruptions and the generalization of the learned concepts.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes