LGNov 6, 2015

Evaluating Protein-protein Interaction Predictors with a Novel 3-Dimensional Metric

arXiv:1511.02196v11 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the need for biologists to have reliable, high-precision predictions for protein-protein interactions, though it is incremental as it introduces a new evaluation metric rather than a new prediction method.

The paper tackles the problem of evaluating protein-protein interaction predictors by proposing a novel 3-dimensional metric that focuses on high precision for adoption by biologists, and it shows that this metric successfully evaluates classifiers and datasets where traditional metrics like ROC and precision-recall curves fail.

In order for the predicted interactions to be directly adopted by biologists, the ma- chine learning predictions have to be of high precision, regardless of recall. This aspect cannot be evaluated or numerically represented well by traditional metrics like accuracy, ROC, or precision-recall curve. In this work, we start from the alignment in sensitivity of ROC and recall of precision-recall curve, and propose an evaluation metric focusing on the ability of a model to be adopted by biologists. This metric evaluates the ability of a machine learning algorithm to predict only new interactions, meanwhile, it eliminates the influence of test dataset. In the experiment of evaluating different classifiers with a same data set and evaluating the same predictor with different datasets, our new metric fulfills the evaluation task of our interest while two widely recognized metrics, ROC and precision-recall curve fail the tasks for different reasons.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes