CVAug 15, 2021

Learning Open-World Object Proposals without Learning to Classify

arXiv:2108.06753v1160 citations
Originality Incremental advance
AI Analysis

This work addresses a key bottleneck in object proposal systems for computer vision, enabling better generalization to unseen objects, though it is incremental in refining existing paradigms.

The paper tackles the problem of object proposal methods overfitting to training categories, which limits their ability to generalize to novel objects in open-world scenarios. The result is a classification-free approach that outperforms existing methods on cross-category and cross-dataset evaluations, with clear improvements in rare and common categories on the LVIS dataset.

Object proposals have become an integral preprocessing steps of many vision pipelines including object detection, weakly supervised detection, object discovery, tracking, etc. Compared to the learning-free methods, learning-based proposals have become popular recently due to the growing interest in object detection. The common paradigm is to learn object proposals from data labeled with a set of object regions and their corresponding categories. However, this approach often struggles with novel objects in the open world that are absent in the training set. In this paper, we identify that the problem is that the binary classifiers in existing proposal methods tend to overfit to the training categories. Therefore, we propose a classification-free Object Localization Network (OLN) which estimates the objectness of each region purely by how well the location and shape of a region overlap with any ground-truth object (e.g., centerness and IoU). This simple strategy learns generalizable objectness and outperforms existing proposals on cross-category generalization on COCO, as well as cross-dataset evaluation on RoboNet, Object365, and EpicKitchens. Finally, we demonstrate the merit of OLN for long-tail object detection on large vocabulary dataset, LVIS, where we notice clear improvement in rare and common categories.

Code Implementations6 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes