LGAIOct 17, 2020

MESA: Boost Ensemble Imbalanced Learning with MEta-SAmpler

arXiv:2010.08830v176 citationsHas Code
Originality Highly original
AI Analysis

This work addresses the challenge of learning unbiased models from class-imbalanced data, which is a common issue in machine learning applications, by proposing a more stable and efficient method compared to existing heuristic-based approaches.

The paper tackles the problem of imbalanced learning by introducing MESA, a novel ensemble framework that adaptively resamples training data to optimize final metrics, achieving improved performance and transferability across synthetic and real-world tasks.

Imbalanced learning (IL), i.e., learning unbiased models from class-imbalanced data, is a challenging problem. Typical IL methods including resampling and reweighting were designed based on some heuristic assumptions. They often suffer from unstable performance, poor applicability, and high computational cost in complex tasks where their assumptions do not hold. In this paper, we introduce a novel ensemble IL framework named MESA. It adaptively resamples the training set in iterations to get multiple classifiers and forms a cascade ensemble model. MESA directly learns the sampling strategy from data to optimize the final metric beyond following random heuristics. Moreover, unlike prevailing meta-learning-based IL solutions, we decouple the model-training and meta-training in MESA by independently train the meta-sampler over task-agnostic meta-data. This makes MESA generally applicable to most of the existing learning models and the meta-sampler can be efficiently applied to new tasks. Extensive experiments on both synthetic and real-world tasks demonstrate the effectiveness, robustness, and transferability of MESA. Our code is available at https://github.com/ZhiningLiu1998/mesa.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes