CVAILGJul 22, 2024

Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond

arXiv:2407.15739v16 citationsh-index: 6
Originality Incremental advance
AI Analysis

This addresses the problem of detecting unknown objects in semantic segmentation for applications like autonomous driving and computer vision, but it is incremental as it adapts existing diffusion methods to a new domain.

The paper tackles out-of-distribution detection for semantic segmentation by extending it from road scenes to general natural images, introducing the ADE-OoD benchmark and a diffusion-based method (DOoD) that performs on par or better than state-of-the-art on road scenes and outperforms previous approaches on ADE-OoD.

In recent years, research on out-of-distribution (OoD) detection for semantic segmentation has mainly focused on road scenes -- a domain with a constrained amount of semantic diversity. In this work, we challenge this constraint and extend the domain of this task to general natural images. To this end, we introduce: 1. the ADE-OoD benchmark, which is based on the ADE20k dataset and includes images from diverse domains with a high semantic diversity, and 2. a novel approach that uses Diffusion score matching for OoD detection (DOoD) and is robust to the increased semantic diversity. ADE-OoD features indoor and outdoor images, defines 150 semantic categories as in-distribution, and contains a variety of OoD objects. For DOoD, we train a diffusion model with an MLP architecture on semantic in-distribution embeddings and build on the score matching interpretation to compute pixel-wise OoD scores at inference time. On common road scene OoD benchmarks, DOoD performs on par or better than the state of the art, without using outliers for training or making assumptions about the data domain. On ADE-OoD, DOoD outperforms previous approaches, but leaves much room for future improvements.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes