LGDec 19, 2025

Alzheimer's Disease Brain Network Mining

arXiv:2512.17276v14.1h-index: 1

Originality Highly original

AI Analysis

This work addresses the problem of expensive and invasive clinical assessments for Alzheimer's disease diagnosis, enabling more efficient use of partially annotated neuroimaging data for clinicians and researchers.

The paper tackles the challenge of limited labeled data for Alzheimer's disease diagnosis by introducing MATCH-AD, a semi-supervised framework that integrates deep learning, graph-based label propagation, and optimal transport, achieving near-perfect diagnostic accuracy on nearly 5,000 subjects with labels for less than one-third of the data.

Machine learning approaches for Alzheimer's disease (AD) diagnosis face a fundamental challenges. Clinical assessments are expensive and invasive, leaving ground truth labels available for only a fraction of neuroimaging datasets. We introduce Multi view Adaptive Transport Clustering for Heterogeneous Alzheimer's Disease (MATCH-AD), a semi supervised framework that integrates deep representation learning, graph-based label propagation, and optimal transport theory to address this limitation. The framework leverages manifold structure in neuroimaging data to propagate diagnostic information from limited labeled samples to larger unlabeled populations, while using Wasserstein distances to quantify disease progression between cognitive states. Evaluated on nearly five thousand subjects from the National Alzheimer's Coordinating Center, encompassing structural MRI measurements from hundreds of brain regions, cerebrospinal fluid biomarkers, and clinical variables MATCHAD achieves near-perfect diagnostic accuracy despite ground truth labels for less than one-third of subjects. The framework substantially outperforms all baseline methods, achieving kappa indicating almost perfect agreement compared to weak agreement for the best baseline, a qualitative transformation in diagnostic reliability. Performance remains clinically useful even under severe label scarcity, and we provide theoretical convergence guarantees with proven bounds on label propagation error and transport stability. These results demonstrate that principled semi-supervised learning can unlock the diagnostic potential of the vast repositories of partially annotated neuroimaging data accumulating worldwide, substantially reducing annotation burden while maintaining accuracy suitable for clinical deployment.

View on arXiv PDF

Similar