MLLGNov 27, 2017

Learning with Biased Complementary Labels

arXiv:1711.09535v3221 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of noisy label annotation in machine learning, particularly for scenarios where true labels are hard to obtain, by improving classification accuracy with biased complementary labels, though it is incremental as it builds on existing complementary label methods.

The paper tackles the classification problem using biased complementary labels, which are surrogate labels indicating classes an observation does not belong to, by modeling annotation biases and proposing a framework to estimate unbiased transition probabilities and modify loss functions, achieving superior performance on benchmark datasets compared to state-of-the-art methods.

In this paper, we study the classification problem in which we have access to easily obtainable surrogate for true labels, namely complementary labels, which specify classes that observations do \textbf{not} belong to. Let $Y$ and $\bar{Y}$ be the true and complementary labels, respectively. We first model the annotation of complementary labels via transition probabilities $P(\bar{Y}=i|Y=j), i\neq j\in\{1,\cdots,c\}$, where $c$ is the number of classes. Previous methods implicitly assume that $P(\bar{Y}=i|Y=j), \forall i\neq j$, are identical, which is not true in practice because humans are biased toward their own experience. For example, as shown in Figure 1, if an annotator is more familiar with monkeys than prairie dogs when providing complementary labels for meerkats, she is more likely to employ "monkey" as a complementary label. We therefore reason that the transition probabilities will be different. In this paper, we propose a framework that contributes three main innovations to learning with \textbf{biased} complementary labels: (1) It estimates transition probabilities with no bias. (2) It provides a general method to modify traditional loss functions and extends standard deep neural network classifiers to learn with biased complementary labels. (3) It theoretically ensures that the classifier learned with complementary labels converges to the optimal one learned with true labels. Comprehensive experiments on several benchmark datasets validate the superiority of our method to current state-of-the-art methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes