LG CO MLNov 4, 2020

Stochastic Hard Thresholding Algorithms for AUC Maximization

Zhenhuan Yang, Baojian Zhou, Yunwen Lei, Yiming Ying

arXiv:2011.02396v13.33 citationsHas Code

Originality Highly original

AI Analysis

This addresses AUC optimization in imbalanced classification, an incremental improvement with a novel method for a known bottleneck.

The paper tackled AUC maximization for imbalanced classification by developing a stochastic hard thresholding algorithm (SHT-AUC) with a per-iteration cost of O(b d), achieving linear convergence up to a tolerance error, with experiments showing efficiency and effectiveness.

In this paper, we aim to develop stochastic hard thresholding algorithms for the important problem of AUC maximization in imbalanced classification. The main challenge is the pairwise loss involved in AUC maximization. We overcome this obstacle by reformulating the U-statistics objective function as an empirical risk minimization (ERM), from which a stochastic hard thresholding algorithm (\texttt{SHT-AUC}) is developed. To our best knowledge, this is the first attempt to provide stochastic hard thresholding algorithms for AUC maximization with a per-iteration cost $Ø(b d)$ where $d$ and $b$ are the dimension of the data and the minibatch size, respectively. We show that the proposed algorithm enjoys the linear convergence rate up to a tolerance error. In particular, we show, if the data is generated from the Gaussian distribution, then its convergence becomes slower as the data gets more imbalanced. We conduct extensive experiments to show the efficiency and effectiveness of the proposed algorithms.

View on arXiv PDF Code

Similar