LG MLJul 5, 2020

Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels

Yu-Ting Chou, Gang Niu, Hsuan-Tien Lin, Masashi Sugiyama

arXiv:2007.02235v315.369 citations

Originality Incremental advance

AI Analysis

This addresses overfitting issues in weakly supervised learning for researchers and practitioners, but it is incremental as it builds on existing URE methods.

The paper tackles overfitting in weakly supervised learning with unbiased risk estimators (UREs) by analyzing learning with complementary labels, showing that UREs provide unbiased gradient estimators (UGEs) but suffer from high variance, and proposes a surrogate complementary loss (SCL) framework that reduces variance and mitigates overfitting, improving URE-based methods.

In weakly supervised learning, unbiased risk estimator(URE) is a powerful tool for training classifiers when training and test data are drawn from different distributions. Nevertheless, UREs lead to overfitting in many problem settings when the models are complex like deep networks. In this paper, we investigate reasons for such overfitting by studying a weakly supervised problem called learning with complementary labels. We argue the quality of gradient estimation matters more in risk minimization. Theoretically, we show that a URE gives an unbiased gradient estimator(UGE). Practically, however, UGEs may suffer from huge variance, which causes empirical gradients to be usually far away from true gradients during minimization. To this end, we propose a novel surrogate complementary loss(SCL) framework that trades zero bias with reduced variance and makes empirical gradients more aligned with true gradients in the direction. Thanks to this characteristic, SCL successfully mitigates the overfitting issue and improves URE-based methods.

View on arXiv PDF

Similar