LGMLJul 21, 2025

Exact Reformulation and Optimization for Direct Metric Optimization in Binary Imbalanced Classification

arXiv:2507.15240v12 citationsh-index: 3Has Code
Originality Highly original
AI Analysis

This work addresses the challenge of handling varying class significance and prescribed metric levels in imbalanced classification, offering a novel framework for direct metric optimization.

The paper tackles the problem of directly optimizing classification metrics like precision, recall, and Fβ-score in binary imbalanced classification, introducing exact constrained reformulations that outperform state-of-the-art methods on benchmark datasets.

For classification with imbalanced class frequencies, i.e., imbalanced classification (IC), standard accuracy is known to be misleading as a performance measure. While most existing methods for IC resort to optimizing balanced accuracy (i.e., the average of class-wise recalls), they fall short in scenarios where the significance of classes varies or certain metrics should reach prescribed levels. In this paper, we study two key classification metrics, precision and recall, under three practical binary IC settings: fix precision optimize recall (FPOR), fix recall optimize precision (FROP), and optimize $F_β$-score (OFBS). Unlike existing methods that rely on smooth approximations to deal with the indicator function involved, \textit{we introduce, for the first time, exact constrained reformulations for these direct metric optimization (DMO) problems}, which can be effectively solved by exact penalty methods. Experiment results on multiple benchmark datasets demonstrate the practical superiority of our approach over the state-of-the-art methods for the three DMO problems. We also expect our exact reformulation and optimization (ERO) framework to be applicable to a wide range of DMO problems for binary IC and beyond. Our code is available at https://github.com/sun-umn/DMO.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes