LGFeb 22, 2023

Delving into Identify-Emphasize Paradigm for Combating Unknown Bias

Bowen Zhao, Chen Chen, Qian-Wei Wang, Anfeng He, Shu-Tao Xia

arXiv:2302.11414v12.01 citationsh-index: 47

Originality Incremental advance

AI Analysis

This work addresses unknown biases in machine learning models, which is an incremental improvement for enhancing generalization in biased datasets.

The paper tackled the problem of dataset biases harming model robustness by improving the identify-emphasize paradigm, proposing methods like bias-conflicting scoring and gradient alignment to enhance identification and optimization, achieving state-of-the-art performance on multiple datasets.

Dataset biases are notoriously detrimental to model robustness and generalization. The identify-emphasize paradigm appears to be effective in dealing with unknown biases. However, we discover that it is still plagued by two challenges: A, the quality of the identified bias-conflicting samples is far from satisfactory; B, the emphasizing strategies only produce suboptimal performance. In this paper, for challenge A, we propose an effective bias-conflicting scoring method (ECS) to boost the identification accuracy, along with two practical strategies -- peer-picking and epoch-ensemble. For challenge B, we point out that the gradient contribution statistics can be a reliable indicator to inspect whether the optimization is dominated by bias-aligned samples. Then, we propose gradient alignment (GA), which employs gradient statistics to balance the contributions of the mined bias-aligned and bias-conflicting samples dynamically throughout the learning process, forcing models to leverage intrinsic features to make fair decisions. Furthermore, we incorporate self-supervised (SS) pretext tasks into training, which enable models to exploit richer features rather than the simple shortcuts, resulting in more robust models. Experiments are conducted on multiple datasets in various settings, demonstrating that the proposed solution can mitigate the impact of unknown biases and achieve state-of-the-art performance.

View on arXiv PDF

Similar