CVAug 31, 2023

Unsupervised Recognition of Unknown Objects for Open-World Object Detection

arXiv:2308.16527v115 citationsh-index: 38
Originality Incremental advance
AI Analysis

This addresses the challenge of detecting unknown objects in dynamic real-world scenarios for computer vision applications, representing an incremental advance over prior methods.

The paper tackles the label bias problem in Open-World Object Detection, where models misclassify unknown objects as background, by proposing an unsupervised discriminative model and classification-free self-training to recognize unknown objects from raw pseudo labels, achieving significant improvements in unknown object detection on MS COCO while maintaining competitive known object performance and better generalization on LVIS and Objects365.

Open-World Object Detection (OWOD) extends object detection problem to a realistic and dynamic scenario, where a detection model is required to be capable of detecting both known and unknown objects and incrementally learning newly introduced knowledge. Current OWOD models, such as ORE and OW-DETR, focus on pseudo-labeling regions with high objectness scores as unknowns, whose performance relies heavily on the supervision of known objects. While they can detect the unknowns that exhibit similar features to the known objects, they suffer from a severe label bias problem that they tend to detect all regions (including unknown object regions) that are dissimilar to the known objects as part of the background. To eliminate the label bias, this paper proposes a novel approach that learns an unsupervised discriminative model to recognize true unknown objects from raw pseudo labels generated by unsupervised region proposal methods. The resulting model can be further refined by a classification-free self-training method which iteratively extends pseudo unknown objects to the unlabeled regions. Experimental results show that our method 1) significantly outperforms the prior SOTA in detecting unknown objects while maintaining competitive performance of detecting known object classes on the MS COCO dataset, and 2) achieves better generalization ability on the LVIS and Objects365 datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes