IV CV LGDec 23, 2024

Enhancing Reconstruction-Based Out-of-Distribution Detection in Brain MRI with Model and Metric Ensembles

Evi M. C. Huijben, Sina Amirrajab, Josien P. W. Pluim

arXiv:2412.17586v110.36 citationsh-index: 54Has CodeComput. Methods Programs Biomed.

Originality Synthesis-oriented

AI Analysis

This addresses safety in automated medical image analysis by improving OOD detection for brain MRI, though it is incremental as it builds on existing reconstruction-based methods.

The study tackled out-of-distribution detection in brain MRI by investigating reconstruction-based autoencoders with model and metric ensembles, achieving a pixel-level area under the Precision-Recall curve of 0.66 for detecting synthetic artifacts.

Out-of-distribution (OOD) detection is crucial for safely deploying automated medical image analysis systems, as abnormal patterns in images could hamper their performance. However, OOD detection in medical imaging remains an open challenge, and we address three gaps: the underexplored potential of a simple OOD detection model, the lack of optimization of deep learning strategies specifically for OOD detection, and the selection of appropriate reconstruction metrics. In this study, we investigated the effectiveness of a reconstruction-based autoencoder for unsupervised detection of synthetic artifacts in brain MRI. We evaluated the general reconstruction capability of the model, analyzed the impact of the selected training epoch and reconstruction metrics, assessed the potential of model and/or metric ensembles, and tested the model on a dataset containing a diverse range of artifacts. Among the metrics assessed, the contrast component of SSIM and LPIPS consistently outperformed others in detecting homogeneous circular anomalies. By combining two well-converged models and using LPIPS and contrast as reconstruction metrics, we achieved a pixel-level area under the Precision-Recall curve of 0.66. Furthermore, with the more realistic OOD dataset, we observed that the detection performance varied between artifact types; local artifacts were more difficult to detect, while global artifacts showed better detection results. These findings underscore the importance of carefully selecting metrics and model configurations, and highlight the need for tailored approaches, as standard deep learning approaches do not always align with the unique needs of OOD detection.

View on arXiv PDF Code

Similar