OmicsCL: Unsupervised Contrastive Learning for Cancer Subtype Discovery and Survival Stratification
This work addresses the problem of personalized medicine by enabling unsupervised discovery of clinically relevant cancer subtypes from heterogeneous omics data, representing an incremental improvement through the integration of survival-aware contrastive learning.
The paper tackled unsupervised cancer subtype discovery from multi-omics data by introducing OmicsCL, a contrastive learning framework with a survival-aware loss, which achieved strong unsupervised concordance with patient survival on the TCGA BRCA dataset.
Unsupervised learning of disease subtypes from multi-omics data presents a significant opportunity for advancing personalized medicine. We introduce OmicsCL, a modular contrastive learning framework that jointly embeds heterogeneous omics modalities-such as gene expression, DNA methylation, and miRNA expression-into a unified latent space. Our method incorporates a survival-aware contrastive loss that encourages the model to learn representations aligned with survival-related patterns, without relying on labeled outcomes. Evaluated on the TCGA BRCA dataset, OmicsCL uncovers clinically meaningful clusters and achieves strong unsupervised concordance with patient survival. The framework demonstrates robustness across hyperparameter configurations and can be tuned to prioritize either subtype coherence or survival stratification. Ablation studies confirm that integrating survival-aware loss significantly enhances the predictive power of learned embeddings. These results highlight the promise of contrastive objectives for biological insight discovery in high-dimensional, heterogeneous omics data.