CVLGROIVJun 15, 2020

Visibility Guided NMS: Efficient Boosting of Amodal Object Detection in Crowded Traffic Scenes

arXiv:2006.08547v121 citations
Originality Incremental advance
AI Analysis

This addresses a critical issue for autonomous driving systems in handling occluded objects, though it is an incremental improvement over existing NMS methods.

The paper tackles the problem of standard Non-Maximum-Suppression (NMS) failing in crowded traffic scenes with high occlusion, such as parked cars or pedestrian crowds, by proposing Visibility Guided NMS (vg-NMS), which improves detection performance for highly occluded objects with little computational overhead and outperforms current state-of-the-art NMS on datasets like KITTI, VIPER, and Synscapes.

Object detection is an important task in environment perception for autonomous driving. Modern 2D object detection frameworks such as Yolo, SSD or Faster R-CNN predict multiple bounding boxes per object that are refined using Non-Maximum-Suppression (NMS) to suppress all but one bounding box. While object detection itself is fully end-to-end learnable and does not require any manual parameter selection, standard NMS is parametrized by an overlap threshold that has to be chosen by hand. In practice, this often leads to an inability of standard NMS strategies to distinguish different objects in crowded scenes in the presence of high mutual occlusion, e.g. for parked cars or crowds of pedestrians. Our novel Visibility Guided NMS (vg-NMS) leverages both pixel-based as well as amodal object detection paradigms and improves the detection performance especially for highly occluded objects with little computational overhead. We evaluate vg-NMS using KITTI, VIPER as well as the Synscapes dataset and show that it outperforms current state-of-the-art NMS.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes