CVApr 30, 2019

Segmentation is All You Need

Zehua Cheng, Yuxiang Wu, Zhenghua Xu, Thomas Lukasiewicz, Weiyang Wang

arXiv:1904.13300v34.720 citations

Originality Highly original

AI Analysis

This addresses the issue of unreliable object detection in challenging scenes for computer vision applications, representing a novel approach rather than an incremental improvement.

The paper tackles the problem of low recall in object detection under extreme cases by proposing WSMA-Seg, an anchor-free and NMS-free model that uses segmentation to achieve accurate detection, outperforming state-of-the-art detectors on multiple datasets.

Region proposal mechanisms are essential for existing deep learning approaches to object detection in images. Although they can generally achieve a good detection performance under normal circumstances, their recall in a scene with extreme cases is unacceptably low. This is mainly because bounding box annotations contain much environment noise information, and non-maximum suppression (NMS) is required to select target boxes. Therefore, in this paper, we propose the first anchor-free and NMS-free object detection model called weakly supervised multimodal annotation segmentation (WSMA-Seg), which utilizes segmentation models to achieve an accurate and robust object detection without NMS. In WSMA-Seg, multimodal annotations are proposed to achieve an instance-aware segmentation using weakly supervised bounding boxes; we also develop a run-data-based following algorithm to trace contours of objects. In addition, we propose a multi-scale pooling segmentation (MSP-Seg) as the underlying segmentation model of WSMA-Seg to achieve a more accurate segmentation and to enhance the detection accuracy of WSMA-Seg. Experimental results on multiple datasets show that the proposed WSMA-Seg approach outperforms the state-of-the-art detectors.

View on arXiv PDF

Similar