CVAILGJul 6, 2024

CBM: Curriculum by Masking

arXiv:2407.05193v25 citationsh-index: 8Has Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of optimizing training for computer vision tasks, offering a versatile method that enhances performance in object recognition and detection, though it appears incremental as it builds on existing curriculum learning approaches.

The paper tackles the problem of improving training efficiency and accuracy in object recognition and detection by proposing Curriculum by Masking (CBM), a curriculum learning strategy that uses patch masking to create an easy-to-hard training schedule, resulting in significant accuracy improvements over conventional and previous curriculum learning methods across multiple datasets and architectures.

We propose Curriculum by Masking (CBM), a novel state-of-the-art curriculum learning strategy that effectively creates an easy-to-hard training schedule via patch (token) masking, offering significant accuracy improvements over the conventional training regime and previous curriculum learning (CL) methods. CBM leverages gradient magnitudes to prioritize the masking of salient image regions via a novel masking algorithm and a novel masking block. Our approach enables controlling sample difficulty via the patch masking ratio, generating an effective easy-to-hard curriculum by gradually introducing harder samples as training progresses. CBM operates with two easily configurable parameters, i.e. the number of patches and the curriculum schedule, making it a versatile curriculum learning approach for object recognition and detection. We conduct experiments with various neural architectures, ranging from convolutional networks to vision transformers, on five benchmark data sets (CIFAR-10, CIFAR-100, ImageNet, Food-101 and PASCAL VOC), to compare CBM with conventional as well as curriculum-based training regimes. Our results reveal the superiority of our strategy compared with the state-of-the-art curriculum learning regimes. We also observe improvements in transfer learning contexts, where CBM surpasses previous work by considerable margins in terms of accuracy. We release our code for free non-commercial use at https://github.com/CroitoruAlin/CBM.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes