CVAug 5, 2017

Optimizing Region Selection for Weakly Supervised Object Detection

arXiv:1708.01723v17 citations
Originality Incremental advance
AI Analysis

This work addresses weakly supervised object detection, which is important for reducing annotation costs in computer vision, but it appears incremental as it builds on existing region selection methods.

The paper tackled the problem of training object detectors with only image-level annotations by proposing an optimized region selection strategy that collects purified positive regions and discriminative negative regions, integrated into an end-to-end CNN framework, achieving improved performance on PASCAL VOC datasets.

Training object detectors with only image-level annotations is very challenging because the target objects are often surrounded by a large number of background clutters. Many existing approaches tackle this problem through object proposal mining. However, the collected positive regions are either low in precision or lack of diversity, and the strategy of collecting negative regions is not carefully designed, neither. Moreover, training is often slow because region selection and object detector training are processed separately. In this context, the primary contribution of this work is to improve weakly supervised detection with an optimized region selection strategy. The proposed method collects purified positive training regions by progressively removing easy background clutters, and selects discriminative negative regions by mining class-specific hard samples. This region selection procedure is further integrated into a CNN-based weakly supervised detection (WSD) framework, and can be performed in each stochastic gradient descent mini-batch during training. Therefore, the entire model can be trained end-to-end efficiently. Extensive evaluation results on PASCAL VOC 2007, VOC 2010 and VOC 2012 datasets are presented which demonstrate that the proposed method effectively improves WSD.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes