CVMay 26, 2021

Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey

Feifei Shao, Long Chen, Jian Shao, Wei Ji, Shaoning Xiao, Lu Ye, Yueting Zhuang, Jun Xiao

arXiv:2105.12694v16.520 citations

Originality Synthesis-oriented

AI Analysis

It addresses the problem of object detection with limited supervision for computer vision researchers, but it is incremental as a survey paper.

This paper provides a comprehensive survey of weakly-supervised object detection (WSOD) and localization (WSOL), summarizing recent achievements, techniques, datasets, and future directions in the field.

Weakly-Supervised Object Detection (WSOD) and Localization (WSOL), i.e., detecting multiple and single instances with bounding boxes in an image using image-level labels, are long-standing and challenging tasks in the CV community. With the success of deep neural networks in object detection, both WSOD and WSOL have received unprecedented attention. Hundreds of WSOD and WSOL methods and numerous techniques have been proposed in the deep learning era. To this end, in this paper, we consider WSOL is a sub-task of WSOD and provide a comprehensive survey of the recent achievements of WSOD. Specifically, we firstly describe the formulation and setting of the WSOD, including the background, challenges, basic framework. Meanwhile, we summarize and analyze all advanced techniques and training tricks for improving detection performance. Then, we introduce the widely-used datasets and evaluation metrics of WSOD. Lastly, we discuss the future directions of WSOD. We believe that these summaries can help pave a way for future research on WSOD and WSOL.

View on arXiv PDF

Similar