CVAug 23, 2021

Towards Balanced Learning for Instance Recognition

arXiv:2108.10175v124 citations
Originality Incremental advance
AI Analysis

This work addresses training inefficiencies in object detection, which is crucial for computer vision applications, but it is incremental as it builds on existing R-CNN frameworks.

The paper tackles the problem of training imbalance in instance recognition detectors, identifying three levels of imbalance (sample, feature, objective) and proposing Libra R-CNN to address them, resulting in improved performance validated on MS COCO, LVIS, and Pascal VOC datasets.

Instance recognition is rapidly advanced along with the developments of various deep convolutional neural networks. Compared to the architectures of networks, the training process, which is also crucial to the success of detectors, has received relatively less attention. In this work, we carefully revisit the standard training practice of detectors, and find that the detection performance is often limited by the imbalance during the training process, which generally consists in three levels - sample level, feature level, and objective level. To mitigate the adverse effects caused thereby, we propose Libra R-CNN, a simple yet effective framework towards balanced learning for instance recognition. It integrates IoU-balanced sampling, balanced feature pyramid, and objective re-weighting, respectively for reducing the imbalance at sample, feature, and objective level. Extensive experiments conducted on MS COCO, LVIS and Pascal VOC datasets prove the effectiveness of the overall balanced design.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes