LGJul 19, 2017

Naive Bayes Classification for Subset Selection

arXiv:1707.06142v11 citations
Originality Synthesis-oriented
AI Analysis

This addresses subset selection in multi-label classification, but it is incremental as it adapts an existing method to a new domain.

The paper tackles the problem of automatically selecting a subset of items from a larger set, using an extension of Naive Bayes classification for multi-label classification, and evaluates it on real-world problems.

This article focuses on the question of learning how to automatically select a subset of items among a bigger set. We introduce a methodology for the inference of ensembles of discrete values, based on the Naive Bayes assumption. Our motivation stems from practical use cases where one wishes to predict an unordered set of (possibly interdependent) values from a set of observed features. This problem can be considered in the context of Multi-label Classification (MLC) where such values are seen as labels associated to continuous or discrete features. We introduce the \nbx algorithm, an extension of Naive Bayes classification into the multi-label domain, discuss its properties and evaluate our approach on real-world problems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes