Reproducing and Improving CheXNet: Deep Learning for Chest X-ray Disease Classification
This work addresses disease classification in medical imaging for healthcare applications, but it is incremental as it builds on existing methods.
The researchers tackled the problem of classifying 14 diseases from chest X-ray images by reproducing and improving upon the CheXNet algorithm, achieving an average AUC-ROC score of 0.85 and an average F1 score of 0.39 across all diseases.
Deep learning for radiologic image analysis is a rapidly growing field in biomedical research and is likely to become a standard practice in modern medicine. On the publicly available NIH ChestX-ray14 dataset, containing X-ray images that are classified by the presence or absence of 14 different diseases, we reproduced an algorithm known as CheXNet, as well as explored other algorithms that outperform CheXNet's baseline metrics. Model performance was primarily evaluated using the F1 score and AUC-ROC, both of which are critical metrics for imbalanced, multi-label classification tasks in medical imaging. The best model achieved an average AUC-ROC score of 0.85 and an average F1 score of 0.39 across all 14 disease classifications present in the dataset.