CVApr 16

Beyond Augmentation: Cross-Modal Transformer Fusion with Bi-directional Attention for Low-Data Aneurysm Screening

arXiv:2512.2218517.0

Predicted impact top 93% in CV · last 90 daysOriginality Incremental advance

AI Analysis

For radiologists, this provides an interpretable, anatomically grounded screening tool that maintains high performance even with limited data, addressing key clinical bottlenecks in aneurysm detection.

CMTF-Net reframes aneurysm screening as anatomically structured reasoning by supervising 14 vascular territories independently, achieving near-perfect AUC-ROC with narrow confidence intervals and sustained precision under class imbalance, enabling interpretable screening in low-data settings.

Intracranial aneurysm rupture causes subarachnoid hemorrhage with mortality near 50%, making early detection critical. Although CTA enables rapid screening, detecting small aneurysms within the complex three-dimensional branching of the Circle of Willis remains expertise-dependent. Existing automated systems are constrained by class imbalance, skull-base artifacts that mimic vascular contrast, and reliance on global binary classification without structured localization, limiting surgical relevance and interpretability. We propose CMTF-Net, a cross-modal target fusion framework that reframes aneurysm screening as anatomically structured reasoning. By supervising 14 vascular territories independently, the network encodes Circle of Willis geometry while allowing multi-segment activation, aligning model design with clinical workflow. CMTF-Net achieves near-perfect AUC-ROC with narrow confidence intervals and sustained precision under imbalance. Grad-CAM and causal maps show spatially localized activation along major arteries, supporting interpretable, anatomically grounded screening in low-data settings.

View on arXiv PDF

Similar