MLSTCOJan 9, 2017

On Reject and Refine Options in Multicategory Classification

arXiv:1701.02265v123 citations
Originality Incremental advance
AI Analysis

This work addresses the need for safer and more informative classification in high-stakes applications like medical diagnosis, though it is incremental as it extends existing binary reject methods to multicategory settings.

The authors tackled the problem of multicategory classification with a reject option, which defers decisions on error-prone cases, and introduced a novel refine option that predicts a set of possible class labels, showing fast convergence rates in theory and improved performance in simulations and real data studies.

In many real applications of statistical learning, a decision made from misclassification can be too costly to afford; in this case, a reject option, which defers the decision until further investigation is conducted, is often preferred. In recent years, there has been much development for binary classification with a reject option. Yet, little progress has been made for the multicategory case. In this article, we propose margin-based multicategory classification methods with a reject option. In addition, and more importantly, we introduce a new and unique refine option for the multicategory problem, where the class of an observation is predicted to be from a set of class labels, whose cardinality is not necessarily one. The main advantage of both options lies in their capacity of identifying error-prone observations. Moreover, the refine option can provide more constructive information for classification by effectively ruling out implausible classes. Efficient implementations have been developed for the proposed methods. On the theoretical side, we offer a novel statistical learning theory and show a fast convergence rate of the excess $\ell$-risk of our methods with emphasis on diverging dimensionality and number of classes. The results can be further improved under a low noise assumption. A set of comprehensive simulation and real data studies has shown the usefulness of the new learning tools compared to regular multicategory classifiers. Detailed proofs of theorems and extended numerical results are included in the supplemental materials available online.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes