CVAILGAug 23, 2023

Masking Strategies for Background Bias Removal in Computer Vision Models

arXiv:2308.12127v113 citationsh-index: 30Has Code
Originality Incremental advance
AI Analysis

This addresses bias removal for computer vision models in fine-grained classification tasks, which is an incremental improvement over existing methods.

The research tackled background-induced bias in fine-grained image classification by evaluating early and late masking strategies on CNN and ViT models, finding that both improved out-of-distribution performance, with early masking on a ViT variant achieving the highest robustness.

Models for fine-grained image classification tasks, where the difference between some classes can be extremely subtle and the number of samples per class tends to be low, are particularly prone to picking up background-related biases and demand robust methods to handle potential examples with out-of-distribution (OOD) backgrounds. To gain deeper insights into this critical problem, our research investigates the impact of background-induced bias on fine-grained image classification, evaluating standard backbone models such as Convolutional Neural Network (CNN) and Vision Transformers (ViT). We explore two masking strategies to mitigate background-induced bias: Early masking, which removes background information at the (input) image level, and late masking, which selectively masks high-level spatial features corresponding to the background. Extensive experiments assess the behavior of CNN and ViT models under different masking strategies, with a focus on their generalization to OOD backgrounds. The obtained findings demonstrate that both proposed strategies enhance OOD performance compared to the baseline models, with early masking consistently exhibiting the best OOD performance. Notably, a ViT variant employing GAP-Pooled Patch token-based classification combined with early masking achieves the highest OOD robustness.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes