MN LG SI MLSep 29, 2020

Incorporating network based protein complex discovery into automated model construction

Paul Scherer, Maja Trȩbacz, Nikola Simidjievski, Zohreh Shams, Helena Andres Terre, Pietro Liò, Mateja Jamnik

arXiv:2010.00387v11.2

Originality Incremental advance

AI Analysis

This work addresses the challenge of integrating protein-protein interaction networks into machine learning models for cancer research, representing an incremental improvement with domain-specific applications.

The authors tackled the problem of analyzing cancer phenotypes by incorporating network biology knowledge into computational graph construction, resulting in a method that outperformed SVM, Fully-Connected MLP, and Randomly-Connected MLPs in all tasks.

We propose a method for gene expression based analysis of cancer phenotypes incorporating network biology knowledge through unsupervised construction of computational graphs. The structural construction of the computational graphs is driven by the use of topological clustering algorithms on protein-protein networks which incorporate inductive biases stemming from network biology research in protein complex discovery. This structurally constrains the hypothesis space over the possible computational graph factorisation whose parameters can then be learned through supervised or unsupervised task settings. The sparse construction of the computational graph enables the differential protein complex activity analysis whilst also interpreting the individual contributions of genes/proteins involved in each individual protein complex. In our experiments analysing a variety of cancer phenotypes, we show that the proposed methods outperform SVM, Fully-Connected MLP, and Randomly-Connected MLPs in all tasks. Our work introduces a scalable method for incorporating large interaction networks as prior knowledge to drive the construction of powerful computational models amenable to introspective study.

View on arXiv PDF

Similar