CVSep 5, 2023

Augmenting Chest X-ray Datasets with Non-Expert Annotations

Veronika Cheplygina, Cathrine Damgaard, Trine Naja Eriksen, Dovile Juodelyte, Amelia Jiménez-Sánchez

arXiv:2309.02244v36.86 citationsh-index: 27Has Code

Originality Incremental advance

AI Analysis

This work addresses dataset quality issues for medical AI researchers, but it is incremental as it builds on existing crowdsourcing and annotation strategies.

The paper tackles the problem of dataset biases in medical image analysis by augmenting chest X-ray datasets with non-expert annotations of tubes, collecting 3.5k and 1k annotations for two datasets, and shows that a detector trained on these annotations generalizes well to expert labels with moderate to almost perfect agreement.

The advancement of machine learning algorithms in medical image analysis requires the expansion of training datasets. A popular and cost-effective approach is automated annotation extraction from free-text medical reports, primarily due to the high costs associated with expert clinicians annotating medical images, such as chest X-rays. However, it has been shown that the resulting datasets are susceptible to biases and shortcuts. Another strategy to increase the size of a dataset is crowdsourcing, a widely adopted practice in general computer vision with some success in medical image analysis. In a similar vein to crowdsourcing, we enhance two publicly available chest X-ray datasets by incorporating non-expert annotations. However, instead of using diagnostic labels, we annotate shortcuts in the form of tubes. We collect 3.5k chest drain annotations for NIH-CXR14, and 1k annotations for four different tube types in PadChest, and create the Non-Expert Annotations of Tubes in X-rays (NEATX) dataset. We train a chest drain detector with the non-expert annotations that generalizes well to expert labels. Moreover, we compare our annotations to those provided by experts and show "moderate" to "almost perfect" agreement. Finally, we present a pathology agreement study to raise awareness about the quality of ground truth annotations. We make our dataset available on Zenodo at https://zenodo.org/records/14944064 and our code available at https://github.com/purrlab/chestxr-label-reliability.

View on arXiv PDF Code

Similar