QMLGIVJan 25, 2023

Gene-SGAN: a method for discovering disease subtypes with imaging and genetic signatures via multi-view weakly-supervised deep clustering

arXiv:2301.10772v14 citationsh-index: 225
Originality Incremental advance
AI Analysis

This addresses the challenge of precision diagnosis and treatment for neurologic and neuropsychiatric diseases by linking imaging-derived subtypes to genetic drivers, though it is incremental as it builds on existing multi-view and weakly-supervised clustering approaches.

The authors tackled the problem of disease heterogeneity by developing Gene-SGAN, a multi-view weakly-supervised deep clustering method that jointly analyzes phenotypic and genetic data to discover disease subtypes with genetic correlations, applied to real datasets from 28,858 individuals to derive subtypes for Alzheimer's disease and brain endophenotypes for hypertension.

Disease heterogeneity has been a critical challenge for precision diagnosis and treatment, especially in neurologic and neuropsychiatric diseases. Many diseases can display multiple distinct brain phenotypes across individuals, potentially reflecting disease subtypes that can be captured using MRI and machine learning methods. However, biological interpretability and treatment relevance are limited if the derived subtypes are not associated with genetic drivers or susceptibility factors. Herein, we describe Gene-SGAN - a multi-view, weakly-supervised deep clustering method - which dissects disease heterogeneity by jointly considering phenotypic and genetic data, thereby conferring genetic correlations to the disease subtypes and associated endophenotypic signatures. We first validate the generalizability, interpretability, and robustness of Gene-SGAN in semi-synthetic experiments. We then demonstrate its application to real multi-site datasets from 28,858 individuals, deriving subtypes of Alzheimer's disease and brain endophenotypes associated with hypertension, from MRI and SNP data. Derived brain phenotypes displayed significant differences in neuroanatomical patterns, genetic determinants, biological and clinical biomarkers, indicating potentially distinct underlying neuropathologic processes, genetic drivers, and susceptibility factors. Overall, Gene-SGAN is broadly applicable to disease subtyping and endophenotype discovery, and is herein tested on disease-related, genetically-driven neuroimaging phenotypes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes