CVApr 9, 2019

Prime Sample Attention in Object Detection

arXiv:1904.04821v2226 citations
Originality Incremental advance
AI Analysis

This work addresses a key bottleneck in object detection by improving training efficiency for better accuracy, though it is incremental as it builds on existing detection frameworks.

The paper tackles the problem of treating all samples equally in object detection by identifying 'Prime Samples' that drive performance, and proposes PISA to focus training on these samples, achieving around 2% mAP improvement on MSCOCO over baselines like OHEM and Focal Loss.

It is a common paradigm in object detection frameworks to treat all samples equally and target at maximizing the performance on average. In this work, we revisit this paradigm through a careful study on how different samples contribute to the overall performance measured in terms of mAP. Our study suggests that the samples in each mini-batch are neither independent nor equally important, and therefore a better classifier on average does not necessarily mean higher mAP. Motivated by this study, we propose the notion of Prime Samples, those that play a key role in driving the detection performance. We further develop a simple yet effective sampling and learning strategy called PrIme Sample Attention (PISA) that directs the focus of the training process towards such samples. Our experiments demonstrate that it is often more effective to focus on prime samples than hard samples when training a detector. Particularly, On the MSCOCO dataset, PISA outperforms the random sampling baseline and hard mining schemes, e.g., OHEM and Focal Loss, consistently by around 2% on both single-stage and two-stage detectors, even with a strong backbone ResNeXt-101.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes