CVLGOct 31, 2023

Long-Tailed Learning as Multi-Objective Optimization

arXiv:2310.20490v210 citationsh-index: 9
Originality Highly original
AI Analysis

This addresses model bias in imbalanced real-world data for machine learning applications, representing an incremental improvement with a novel method for a known bottleneck.

The paper tackles the seesaw dilemma in long-tailed learning, where improving performance on tail classes often harms head classes, by formulating it as a multi-objective optimization problem and proposing a Gradient-Balancing Grouping (GBG) strategy, achieving state-of-the-art results on benchmarks.

Real-world data is extremely imbalanced and presents a long-tailed distribution, resulting in models that are biased towards classes with sufficient samples and perform poorly on rare classes. Recent methods propose to rebalance classes but they undertake the seesaw dilemma (what is increasing performance on tail classes may decrease that of head classes, and vice versa). In this paper, we argue that the seesaw dilemma is derived from gradient imbalance of different classes, in which gradients of inappropriate classes are set to important for updating, thus are prone to overcompensation or undercompensation on tail classes. To achieve ideal compensation, we formulate the long-tailed recognition as an multi-objective optimization problem, which fairly respects the contributions of head and tail classes simultaneously. For efficiency, we propose a Gradient-Balancing Grouping (GBG) strategy to gather the classes with similar gradient directions, thus approximately make every update under a Pareto descent direction. Our GBG method drives classes with similar gradient directions to form more representative gradient and provide ideal compensation to the tail classes. Moreover, We conduct extensive experiments on commonly used benchmarks in long-tailed learning and demonstrate the superiority of our method over existing SOTA methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes