QMAILGDec 2, 2022

Multiscale Graph Neural Networks for Protein Residue Contact Map Prediction

arXiv:2212.02251v22 citationsh-index: 64
Originality Incremental advance
AI Analysis

This work addresses a specific problem in protein structural analysis for bioinformatics, representing an incremental improvement over existing methods.

The paper tackled the challenge of predicting protein residue contact maps across short-, medium-, and long-range distances by proposing a multiscale graph neural network (GNN) approach combined with a recurrent neural network (RNN). Test results on the ProteinNet dataset showed improved accuracy for all contact ranges, including long-range predictions.

Machine learning (ML) is revolutionizing protein structural analysis, including an important subproblem of predicting protein residue contact maps, i.e., which amino-acid residues are in close spatial proximity given the amino-acid sequence of a protein. Despite recent progresses in ML-based protein contact prediction, predicting contacts with a wide range of distances (commonly classified into short-, medium- and long-range contacts) remains a challenge. Here, we propose a multiscale graph neural network (GNN) based approach taking a cue from multiscale physics simulations, in which a standard pipeline involving a recurrent neural network (RNN) is augmented with three GNNs to refine predictive capability for short-, medium- and long-range residue contacts, respectively. Test results on the ProteinNet dataset show improved accuracy for contacts of all ranges using the proposed multiscale RNN+GNN approach over the conventional approach, including the most challenging case of long-range contact prediction.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes