MELGMLMay 7, 2019

F-measure Maximizing Logistic Regression

arXiv:1905.02535v11.24 citations
Originality Incremental advance
AI Analysis

This addresses classification issues in imbalanced datasets, but it is incremental as it builds on existing F-measure optimization techniques.

The paper tackles the problem of logistic regression performing poorly on imbalanced data by proposing an F-measure optimization method, showing it improves performance in experiments with real-world data.

Logistic regression is a widely used method in several fields. When applying logistic regression to imbalanced data, for which majority classes dominate over minority classes, all class labels are estimated as `majority class.' In this article, we use an F-measure optimization method to improve the performance of logistic regression applied to imbalanced data. While many F-measure optimization methods adopt a ratio of the estimators to approximate the F-measure, the ratio of the estimators tends to have more bias than when the ratio is directly approximated. Therefore, we employ an approximate F-measure for estimating the relative density ratio. In addition, we define a relative F-measure and approximate the relative F-measure. We show an algorithm for a logistic regression weighted approximated relative to the F-measure. The experimental results using real world data demonstrated that our proposed method is an efficient algorithm to improve the performance of logistic regression applied to imbalanced data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes