CVAIAug 29, 2025

Multimodal Deep Learning for Phyllodes Tumor Classification from Ultrasound and Clinical Data

arXiv:2509.00213v21 citationsh-index: 9BSN
Originality Incremental advance
AI Analysis

This work addresses a specific clinical problem in breast tumor diagnosis for medical practitioners, offering an incremental improvement by integrating multimodal data.

The study tackled the problem of classifying rare phyllodes tumors from breast ultrasound and clinical data to reduce unnecessary surgeries, achieving AUC-ROC scores up to 0.9427 and F1-scores up to 0.7294 with a multimodal deep learning framework.

Phyllodes tumors (PTs) are rare fibroepithelial breast lesions that are difficult to classify preoperatively due to their radiological similarity to benign fibroadenomas. This often leads to unnecessary surgical excisions. To address this, we propose a multimodal deep learning framework that integrates breast ultrasound (BUS) images with structured clinical data to improve diagnostic accuracy. We developed a dual-branch neural network that extracts and fuses features from ultrasound images and patient metadata from 81 subjects with confirmed PTs. Class-aware sampling and subject-stratified 5-fold cross-validation were applied to prevent class imbalance and data leakage. The results show that our proposed multimodal method outperforms unimodal baselines in classifying benign versus borderline/malignant PTs. Among six image encoders, ConvNeXt and ResNet18 achieved the best performance in the multimodal setting, with AUC-ROC scores of 0.9427 and 0.9349, and F1-scores of 0.6720 and 0.7294, respectively. This study demonstrates the potential of multimodal AI to serve as a non-invasive diagnostic tool, reducing unnecessary biopsies and improving clinical decision-making in breast tumor management.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes