LG MLJul 21, 2025

Exact Reformulation and Optimization for Direct Metric Optimization in Binary Imbalanced Classification

Le Peng, Yash Travadi, Chuan He, Ying Cui, Ju Sun

arXiv:2507.15240v19.42 citationsh-index: 28Has Code

Originality Highly original

AI Analysis

This work addresses the challenge of handling varying class significance and prescribed metric levels in imbalanced classification, offering a novel framework for direct metric optimization.

The paper tackles the problem of directly optimizing classification metrics like precision, recall, and Fβ-score in binary imbalanced classification, introducing exact constrained reformulations that outperform state-of-the-art methods on benchmark datasets.

For classification with imbalanced class frequencies, i.e., imbalanced classification (IC), standard accuracy is known to be misleading as a performance measure. While most existing methods for IC resort to optimizing balanced accuracy (i.e., the average of class-wise recalls), they fall short in scenarios where the significance of classes varies or certain metrics should reach prescribed levels. In this paper, we study two key classification metrics, precision and recall, under three practical binary IC settings: fix precision optimize recall (FPOR), fix recall optimize precision (FROP), and optimize $F_β$-score (OFBS). Unlike existing methods that rely on smooth approximations to deal with the indicator function involved, \textit{we introduce, for the first time, exact constrained reformulations for these direct metric optimization (DMO) problems}, which can be effectively solved by exact penalty methods. Experiment results on multiple benchmark datasets demonstrate the practical superiority of our approach over the state-of-the-art methods for the three DMO problems. We also expect our exact reformulation and optimization (ERO) framework to be applicable to a wide range of DMO problems for binary IC and beyond. Our code is available at https://github.com/sun-umn/DMO.

View on arXiv PDF Code

Similar