CVDec 14, 2022

Modeling Multimodal Aleatoric Uncertainty in Segmentation with Mixture of Stochastic Experts

arXiv:2212.07328v213 citationsh-index: 37
Originality Incremental advance
AI Analysis

This addresses the need for reliable uncertainty estimation in safety-critical segmentation applications, representing an incremental improvement over existing methods.

The paper tackles the challenge of predicting well-calibrated aleatoric uncertainty in segmentation, particularly for ambiguous images, by proposing a mixture of stochastic experts (MoSE) model with a Wasserstein-like loss, achieving state-of-the-art or competitive performance on datasets like LIDC-IDRI and a modified Cityscapes.

Equipping predicted segmentation with calibrated uncertainty is essential for safety-critical applications. In this work, we focus on capturing the data-inherent uncertainty (aka aleatoric uncertainty) in segmentation, typically when ambiguities exist in input images. Due to the high-dimensional output space and potential multiple modes in segmenting ambiguous images, it remains challenging to predict well-calibrated uncertainty for segmentation. To tackle this problem, we propose a novel mixture of stochastic experts (MoSE) model, where each expert network estimates a distinct mode of the aleatoric uncertainty and a gating network predicts the probabilities of an input image being segmented in those modes. This yields an efficient two-level uncertainty representation. To learn the model, we develop a Wasserstein-like loss that directly minimizes the distribution distance between the MoSE and ground truth annotations. The loss can easily integrate traditional segmentation quality measures and be efficiently optimized via constraint relaxation. We validate our method on the LIDC-IDRI dataset and a modified multimodal Cityscapes dataset. Results demonstrate that our method achieves the state-of-the-art or competitive performance on all metrics.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes