Performance Analysis of Deep Learning Models for Femur Segmentation in MRI Scan
This work addresses femur segmentation in MRI scans for medical imaging applications, but it is incremental as it compares existing methods on a specific dataset.
The study evaluated and compared three CNN-based models (U-Net, Attention U-Net, U-KAN) and one transformer-based model (SAM 2) for segmenting femur bone structures in MRI scans, finding that Attention U-Net achieved the highest overall Dice Similarity Coefficient scores ranging from 0.932 to 0.954.
Convolutional neural networks like U-Net excel in medical image segmentation, while attention mechanisms and KAN enhance feature extraction. Meta's SAM 2 uses Vision Transformers for prompt-based segmentation without fine-tuning. However, biases in these models impact generalization with limited data. In this study, we systematically evaluate and compare the performance of three CNN-based models, i.e., U-Net, Attention U-Net, and U-KAN, and one transformer-based model, i.e., SAM 2 for segmenting femur bone structures in MRI scan. The dataset comprises 11,164 MRI scans with detailed annotations of femoral regions. Performance is assessed using the Dice Similarity Coefficient, which ranges from 0.932 to 0.954. Attention U-Net achieves the highest overall scores, while U-KAN demonstrated superior performance in anatomical regions with a smaller region of interest, leveraging its enhanced learning capacity to improve segmentation accuracy.