CVJan 10, 2024

Wasserstein Distance-based Expansion of Low-Density Latent Regions for Unknown Class Detection

arXiv:2401.05594v33 citationsh-index: 29
Originality Incremental advance
AI Analysis

This addresses the challenge of reliably detecting unknown objects in real-world applications like autonomous driving, though it builds incrementally on the existing Open-Det framework.

The paper tackles the problem of open-set object detection where detectors incorrectly classify unknown objects as known categories, by introducing a novel approach that expands low-density latent regions using Wasserstein distance, resulting in a 17%-22% reduction in open-set errors and up to 16% improvement in novelty detection.

This paper addresses the significant challenge in open-set object detection (OSOD): the tendency of state-of-the-art detectors to erroneously classify unknown objects as known categories with high confidence. We present a novel approach that effectively identifies unknown objects by distinguishing between high and low-density regions in latent space. Our method builds upon the Open-Det (OD) framework, introducing two new elements to the loss function. These elements enhance the known embedding space's clustering and expand the unknown space's low-density regions. The first addition is the Class Wasserstein Anchor (CWA), a new function that refines the classification boundaries. The second is a spectral normalisation step, improving the robustness of the model. Together, these augmentations to the existing Contrastive Feature Learner (CFL) and Unknown Probability Learner (UPL) loss functions significantly improve OSOD performance. Our proposed OpenDet-CWA (OD-CWA) method demonstrates: a) a reduction in open-set errors by approximately 17%-22%, b) an enhancement in novelty detection capability by 1.5%-16%, and c) a decrease in the wilderness index by 2%-20% across various open-set scenarios. These results represent a substantial advancement in the field, showcasing the potential of our approach in managing the complexities of open-set object detection.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes