LGMLMay 23, 2019

Randomized Reference Classifier with Gaussian Distribution and Soft Confusion Matrix Applied to the Improving Weak Classifiers

arXiv:1905.09820v11 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement for machine learning practitioners working with weak classifiers and imbalanced data.

The paper tackles the problem of building Randomized Reference Classifier models using probability distributions other than beta distribution by proposing a truncated normal distribution approach, with results showing it is comparable to beta-based models and better at discovering minority class objects for some classifiers.

In this paper, an issue of building the RRC model using probability distributions other than beta distribution is addressed. More precisely, in this paper, we propose to build the RRR model using the truncated normal distribution. Heuristic procedures for expected value and the variance of the truncated-normal distribution are also proposed. The proposed approach is tested using SCM-based model for testing the consequences of applying the truncated normal distribution in the RRC model. The experimental evaluation is performed using four different base classifiers and seven quality measures. The results showed that the proposed approach is comparable to the RRC model built using beta distribution. What is more, for some base classifiers, the truncated-normal-based SCM algorithm turned out to be better at discovering objects coming from minority classes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes