CVLGIVJul 30, 2019

Propose-and-Attend Single Shot Detector

arXiv:1907.12736v19 citations
Originality Incremental advance
AI Analysis

This work addresses efficiency and accuracy challenges in object detection for computer vision applications, presenting an incremental improvement over existing one-stage methods.

The authors tackled the problem of improving one-stage object detectors by introducing a propose-and-attend module that adjusts default boxes and aligns receptive fields, achieving accuracy comparable to state-of-the-art detectors with reduced parameters and computational overhead.

We present a simple yet effective prediction module for a one-stage detector. The main process is conducted in a coarse-to-fine manner. First, the module roughly adjusts the default boxes to well capture the extent of target objects in an image. Second, given the adjusted boxes, the module aligns the receptive field of the convolution filters accordingly, not requiring any embedding layers. Both steps build a propose-and-attend mechanism, mimicking two-stage detectors in a highly efficient manner. To verify its effectiveness, we apply the proposed module to a basic one-stage detector SSD. Our final model achieves an accuracy comparable to that of state-of-the-art detectors while using a fraction of their model parameters and computational overheads. Moreover, we found that the proposed module has two strong applications. 1) The module can be successfully integrated into a lightweight backbone, further pushing the efficiency of the one-stage detector. 2) The module also allows train-from-scratch without relying on any sophisticated base networks as previous methods do.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes