CVDec 21, 2025

Breast Cancer Recurrence Risk Prediction Based on Multiple Instance Learning

arXiv:2512.18734v1

Originality Synthesis-oriented

AI Analysis

This work addresses the clinical challenge of predicting breast cancer recurrence risk for patients, but it is incremental as it applies existing methods to a specific dataset.

This study tackled breast cancer recurrence risk prediction by developing and comparing three Multiple Instance Learning frameworks on whole-slide images, with the best model achieving a mean AUC of 0.836 and 76.2% accuracy in stratifying patients into low, medium, and high risk tiers.

Predicting breast cancer recurrence risk is a critical clinical challenge. This study investigates the potential of computational pathology to stratify patients using deep learning on routine Hematoxylin and Eosin (H&E) stained whole-slide images (WSIs). We developed and compared three Multiple Instance Learning (MIL) frameworks -- CLAM-SB, ABMIL, and ConvNeXt-MIL-XGBoost -- on an in-house dataset of 210 patient cases. The models were trained to predict 5-year recurrence risk, categorized into three tiers (low, medium, high), with ground truth labels established by the 21-gene Recurrence Score. Features were extracted using the UNI and CONCH pre-trained models. In a 5-fold cross-validation, the modified CLAM-SB model demonstrated the strongest performance, achieving a mean Area Under the Curve (AUC) of 0.836 and a classification accuracy of 76.2%. Our findings demonstrate the feasibility of using deep learning on standard histology slides for automated, genomics-correlated risk stratification, highlighting a promising pathway toward rapid and cost-effective clinical decision support.

View on arXiv PDF

Similar