IV CV LGApr 4, 2022

Feature robustness and sex differences in medical imaging: a case study in MRI-based Alzheimer's disease detection

Eike Petersen, Aasa Feragen, Maria Luise da Costa Zemsch, Anders Henriksen, Oskar Eiler Wiese Christensen, Melanie Ganz

arXiv:2204.01737v320.232 citationsh-index: 23Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses robustness and sex differences in medical imaging for Alzheimer's disease detection, highlighting the value of manual features, but it is incremental as it builds on existing methods in a specific domain.

The study compared logistic regression with manually selected volumetric features and a convolutional neural network (CNN) on MRI data for Alzheimer's disease detection, finding that logistic regression outperformed the CNN and was robust to dataset composition, while CNN performance improved for both sexes with more female training data.

Convolutional neural networks have enabled significant improvements in medical image-based diagnosis. It is, however, increasingly clear that these models are susceptible to performance degradation when facing spurious correlations and dataset shift, leading, e.g., to underperformance on underrepresented patient groups. In this paper, we compare two classification schemes on the ADNI MRI dataset: a simple logistic regression model using manually selected volumetric features, and a convolutional neural network trained on 3D MRI data. We assess the robustness of the trained models in the face of varying dataset splits, training set sex composition, and stage of disease. In contrast to earlier work in other imaging modalities, we do not observe a clear pattern of improved model performance for the majority group in the training dataset. Instead, while logistic regression is fully robust to dataset composition, we find that CNN performance is generally improved for both male and female subjects when including more female subjects in the training dataset. We hypothesize that this might be due to inherent differences in the pathology of the two sexes. Moreover, in our analysis, the logistic regression model outperforms the 3D CNN, emphasizing the utility of manual feature specification based on prior knowledge, and the need for more robust automatic feature selection.

View on arXiv PDF Code

Similar