LGCVMLJan 6, 2020

Identifying and Compensating for Feature Deviation in Imbalanced Deep Learning

arXiv:2001.01385v4121 citations
AI Analysis

This addresses the issue of poor classifier performance on minor classes in imbalanced data for deep learning practitioners, though it is incremental as it builds on existing methods for handling class imbalance.

The paper tackled the problem of class-imbalanced deep learning by identifying feature deviation in ConvNets, where test data of minor classes are pushed to low decision value regions, and proposed class-dependent temperatures (CDT) to compensate, achieving promising performance on benchmark datasets.

Classifiers trained with class-imbalanced data are known to perform poorly on test data of the "minor" classes, of which we have insufficient training data. In this paper, we investigate learning a ConvNet classifier under such a scenario. We found that a ConvNet significantly over-fits the minor classes, which is quite opposite to traditional machine learning algorithms that often under-fit minor classes. We conducted a series of analysis and discovered the feature deviation phenomenon -- the learned ConvNet generates deviated features between the training and test data of minor classes -- which explains how over-fitting happens. To compensate for the effect of feature deviation which pushes test data toward low decision value regions, we propose to incorporate class-dependent temperatures (CDT) in training a ConvNet. CDT simulates feature deviation in the training phase, forcing the ConvNet to enlarge the decision values for minor-class data so that it can overcome real feature deviation in the test phase. We validate our approach on benchmark datasets and achieve promising performance. We hope that our insights can inspire new ways of thinking in resolving class-imbalanced deep learning.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes