CVJan 15

ICONIC-444: A 3.1-Million-Image Dataset for OOD Detection Research

arXiv:2601.10802v1h-index: 4
Originality Synthesis-oriented
AI Analysis

This dataset addresses a bottleneck for researchers in OOD detection by offering structured, diverse data for rigorous evaluation across varying difficulty levels.

The authors tackled the lack of large, high-quality datasets for out-of-distribution (OOD) detection by introducing ICONIC-444, a dataset with over 3.1 million images across 444 classes, and provided baseline results for 22 state-of-the-art methods.

Current progress in out-of-distribution (OOD) detection is limited by the lack of large, high-quality datasets with clearly defined OOD categories across varying difficulty levels (near- to far-OOD) that support both fine- and coarse-grained computer vision tasks. To address this limitation, we introduce ICONIC-444 (Image Classification and OOD Detection with Numerous Intricate Complexities), a specialized large-scale industrial image dataset containing over 3.1 million RGB images spanning 444 classes tailored for OOD detection research. Captured with a prototype industrial sorting machine, ICONIC-444 closely mimics real-world tasks. It complements existing datasets by offering structured, diverse data suited for rigorous OOD evaluation across a spectrum of task complexities. We define four reference tasks within ICONIC-444 to benchmark and advance OOD detection research and provide baseline results for 22 state-of-the-art post-hoc OOD detection methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes