LGQMMLNov 21, 2017

Training large margin host-pathogen protein-protein interaction predictors

arXiv:1711.07886v123 citations
Originality Incremental advance
AI Analysis

This work tackles the problem of predicting host-pathogen interactions to aid drug discovery for infectious diseases, but it is incremental as it builds on existing computational methods with specific improvements.

The study developed large margin machine learning models to predict host-pathogen protein-protein interactions (HPIs), addressing challenges in negative sample selection, size, and margin violation penalties, and introduced a weighted SVM method with a web server called HoPItor for human-viral protein interaction prediction.

Detection of protein-protein interactions (PPIs) plays a vital role in molecular biology. Particularly, infections are caused by the interactions of host and pathogen proteins. It is important to identify host-pathogen interactions (HPIs) to discover new drugs to counter infectious diseases. Conventional wet lab PPI prediction techniques have limitations in terms of large scale application and budget. Hence, computational approaches are developed to predict PPIs. This study aims to develop large margin machine learning models to predict interspecies PPIs with a special interest in host-pathogen protein interactions (HPIs). Especially, we focus on seeking answers to three queries that arise while developing an HPI predictor. 1) How should we select negative samples? 2) What should be the size of negative samples as compared to the positive samples? 3) What type of margin violation penalty should be used to train the predictor? We compare two available methods for negative sampling. Moreover, we propose a new method of assigning weights to each training example in weighted SVM depending on the distance of the negative examples from the positive examples. We have also developed a web server for our HPI predictor called HoPItor (Host Pathogen Interaction predicTOR) that can predict interactions between human and viral proteins. This webserver can be accessed at the URL: http://faculty.pieas.edu.pk/fayyaz/software.html#HoPItor.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes