LGJan 4, 2022

AutoBalance: Optimized Loss Functions for Imbalanced Data

Mingchen Li, Xuechen Zhang, Christos Thrampoulidis, Jiasi Chen, Samet Oymak

arXiv:2201.01212v124.087 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses fairness and generalization issues in imbalanced classification for machine learning practitioners, representing an incremental improvement through automated loss design.

The authors tackled the problem of poor generalization and fairness in imbalanced datasets by proposing AutoBalance, a bi-level optimization framework that automatically designs a loss function to optimize accuracy and fairness, achieving benefits over state-of-the-art approaches in empirical evaluations.

Imbalanced datasets are commonplace in modern machine learning problems. The presence of under-represented classes or groups with sensitive attributes results in concerns about generalization and fairness. Such concerns are further exacerbated by the fact that large capacity deep nets can perfectly fit the training data and appear to achieve perfect accuracy and fairness during training, but perform poorly during test. To address these challenges, we propose AutoBalance, a bi-level optimization framework that automatically designs a training loss function to optimize a blend of accuracy and fairness-seeking objectives. Specifically, a lower-level problem trains the model weights, and an upper-level problem tunes the loss function by monitoring and optimizing the desired objective over the validation data. Our loss design enables personalized treatment for classes/groups by employing a parametric cross-entropy loss and individualized data augmentation schemes. We evaluate the benefits and performance of our approach for the application scenarios of imbalanced and group-sensitive classification. Extensive empirical evaluations demonstrate the benefits of AutoBalance over state-of-the-art approaches. Our experimental findings are complemented with theoretical insights on loss function design and the benefits of train-validation split. All code is available open-source.

View on arXiv PDF Code

Similar