CVSep 5, 2023

Augmenting Chest X-ray Datasets with Non-Expert Annotations

arXiv:2309.02244v36 citationsh-index: 27Has Code
Originality Incremental advance
AI Analysis

This work addresses dataset quality issues for medical AI researchers, but it is incremental as it builds on existing crowdsourcing and annotation strategies.

The paper tackles the problem of dataset biases in medical image analysis by augmenting chest X-ray datasets with non-expert annotations of tubes, collecting 3.5k and 1k annotations for two datasets, and shows that a detector trained on these annotations generalizes well to expert labels with moderate to almost perfect agreement.

The advancement of machine learning algorithms in medical image analysis requires the expansion of training datasets. A popular and cost-effective approach is automated annotation extraction from free-text medical reports, primarily due to the high costs associated with expert clinicians annotating medical images, such as chest X-rays. However, it has been shown that the resulting datasets are susceptible to biases and shortcuts. Another strategy to increase the size of a dataset is crowdsourcing, a widely adopted practice in general computer vision with some success in medical image analysis. In a similar vein to crowdsourcing, we enhance two publicly available chest X-ray datasets by incorporating non-expert annotations. However, instead of using diagnostic labels, we annotate shortcuts in the form of tubes. We collect 3.5k chest drain annotations for NIH-CXR14, and 1k annotations for four different tube types in PadChest, and create the Non-Expert Annotations of Tubes in X-rays (NEATX) dataset. We train a chest drain detector with the non-expert annotations that generalizes well to expert labels. Moreover, we compare our annotations to those provided by experts and show "moderate" to "almost perfect" agreement. Finally, we present a pathology agreement study to raise awareness about the quality of ground truth annotations. We make our dataset available on Zenodo at https://zenodo.org/records/14944064 and our code available at https://github.com/purrlab/chestxr-label-reliability.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes