CVNov 26, 2021

Modeling Annotator Preference and Stochastic Annotation Error for Medical Image Segmentation

arXiv:2111.13410v329 citations
Originality Incremental advance
AI Analysis

This addresses annotation variability in medical imaging, which can affect model reliability, though it is incremental in improving segmentation accuracy.

The paper tackles the problem of annotator-related biases in medical image segmentation by proposing a framework that disentangles annotator preferences from stochastic errors, achieving promising performance on five segmentation tasks across two benchmarks.

Manual annotation of medical images is highly subjective, leading to inevitable and huge annotation biases. Deep learning models may surpass human performance on a variety of tasks, but they may also mimic or amplify these biases. Although we can have multiple annotators and fuse their annotations to reduce stochastic errors, we cannot use this strategy to handle the bias caused by annotators' preferences. In this paper, we highlight the issue of annotator-related biases on medical image segmentation tasks, and propose a Preference-involved Annotation Distribution Learning (PADL) framework to address it from the perspective of disentangling an annotator's preference from stochastic errors using distribution learning so as to produce not only a meta segmentation but also the segmentation possibly made by each annotator. Under this framework, a stochastic error modeling (SEM) module estimates the meta segmentation map and average stochastic error map, and a series of human preference modeling (HPM) modules estimate each annotator's segmentation and the corresponding stochastic error. We evaluated our PADL framework on two medical image benchmarks with different imaging modalities, which have been annotated by multiple medical professionals, and achieved promising performance on all five medical image segmentation tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes