CVMay 26, 2021

Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey

arXiv:2105.12694v120 citations
Originality Synthesis-oriented
AI Analysis

It addresses the problem of object detection with limited supervision for computer vision researchers, but it is incremental as a survey paper.

This paper provides a comprehensive survey of weakly-supervised object detection (WSOD) and localization (WSOL), summarizing recent achievements, techniques, datasets, and future directions in the field.

Weakly-Supervised Object Detection (WSOD) and Localization (WSOL), i.e., detecting multiple and single instances with bounding boxes in an image using image-level labels, are long-standing and challenging tasks in the CV community. With the success of deep neural networks in object detection, both WSOD and WSOL have received unprecedented attention. Hundreds of WSOD and WSOL methods and numerous techniques have been proposed in the deep learning era. To this end, in this paper, we consider WSOL is a sub-task of WSOD and provide a comprehensive survey of the recent achievements of WSOD. Specifically, we firstly describe the formulation and setting of the WSOD, including the background, challenges, basic framework. Meanwhile, we summarize and analyze all advanced techniques and training tricks for improving detection performance. Then, we introduce the widely-used datasets and evaluation metrics of WSOD. Lastly, we discuss the future directions of WSOD. We believe that these summaries can help pave a way for future research on WSOD and WSOL.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes