LGAICVSep 25, 2024

Scalable Ensemble Diversification for OOD Generalization and Detection

arXiv:2409.16797v13 citationsh-index: 4Has Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of scalable ensemble training for OOD tasks, which is incremental as it builds on existing methods but improves efficiency and applicability to large-scale settings.

The paper tackles the problem of training diverse ensembles for out-of-distribution (OOD) generalization and detection by proposing a scalable method that avoids the need for OOD samples, achieving large benefits in ImageNet experiments and surpassing many OOD detection baselines.

Training a diverse ensemble of models has several practical applications such as providing candidates for model selection with better out-of-distribution (OOD) generalization, and enabling the detection of OOD samples via Bayesian principles. An existing approach to diverse ensemble training encourages the models to disagree on provided OOD samples. However, the approach is computationally expensive and it requires well-separated ID and OOD examples, such that it has only been demonstrated in small-scale settings. $\textbf{Method.}$ This work presents a method for Scalable Ensemble Diversification (SED) applicable to large-scale settings (e.g. ImageNet) that does not require OOD samples. Instead, SED identifies hard training samples on the fly and encourages the ensemble members to disagree on these. To improve scaling, we show how to avoid the expensive computations in existing methods of exhaustive pairwise disagreements across models. $\textbf{Results.}$ We evaluate the benefits of diversification with experiments on ImageNet. First, for OOD generalization, we observe large benefits from the diversification in multiple settings including output-space (classical) ensembles and weight-space ensembles (model soups). Second, for OOD detection, we turn the diversity of ensemble hypotheses into a novel uncertainty score estimator that surpasses a large number of OOD detection baselines. Code is available here: https://github.com/AlexanderRubinstein/diverse-universe-public.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes