LGMLApr 13, 2020

Diverse Instances-Weighting Ensemble based on Region Drift Disagreement for Concept Drift Adaptation

arXiv:2004.05810v163 citations
Originality Incremental advance
AI Analysis

This addresses concept drift adaptation for data stream classification, with incremental improvements in ensemble diversity.

The paper tackled the challenge of maintaining ensemble diversity in data stream classification under concept drift by proposing a diversity measurement based on regional distribution change agreement, resulting in the DiwE algorithm that showed effectiveness on synthetic and real-world benchmarks.

Concept drift refers to changes in the distribution of underlying data and is an inherent property of evolving data streams. Ensemble learning, with dynamic classifiers, has proved to be an efficient method of handling concept drift. However, the best way to create and maintain ensemble diversity with evolving streams is still a challenging problem. In contrast to estimating diversity via inputs, outputs, or classifier parameters, we propose a diversity measurement based on whether the ensemble members agree on the probability of a regional distribution change. In our method, estimations over regional distribution changes are used as instance weights. Constructing different region sets through different schemes will lead to different drift estimation results, thereby creating diversity. The classifiers that disagree the most are selected to maximize diversity. Accordingly, an instance-based ensemble learning algorithm, called the diverse instance weighting ensemble (DiwE), is developed to address concept drift for data stream classification problems. Evaluations of various synthetic and real-world data stream benchmarks show the effectiveness and advantages of the proposed algorithm.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes