LGMLJun 21, 2020

On Aggregation in Ensembles of Multilabel Classifiers

arXiv:2006.11916v18 citations
Originality Incremental advance
AI Analysis

This addresses a gap in ensemble methods for multilabel classification, offering tailored solutions for specific loss functions, but it is incremental as it builds on existing ensemble techniques.

The paper tackles the problem of aggregating predictions in ensembles of multilabel classifiers by introducing a formal framework with two approaches, PTC and CTP, which outperform standard voting techniques in experiments.

While a variety of ensemble methods for multilabel classification have been proposed in the literature, the question of how to aggregate the predictions of the individual members of the ensemble has received little attention so far. In this paper, we introduce a formal framework of ensemble multilabel classification, in which we distinguish two principal approaches: "predict then combine" (PTC), where the ensemble members first make loss minimizing predictions which are subsequently combined, and "combine then predict" (CTP), which first aggregates information such as marginal label probabilities from the individual ensemble members, and then derives a prediction from this aggregation. While both approaches generalize voting techniques commonly used for multilabel ensembles, they allow to explicitly take the target performance measure into account. Therefore, concrete instantiations of CTP and PTC can be tailored to concrete loss functions. Experimentally, we show that standard voting techniques are indeed outperformed by suitable instantiations of CTP and PTC, and provide some evidence that CTP performs well for decomposable loss functions, whereas PTC is the better choice for non-decomposable losses.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes