BMCLLGQMOct 21, 2024

CPE-Pro: A Structure-Sensitive Deep Learning Method for Protein Representation and Origin Evaluation

arXiv:2410.15592v22 citationsh-index: 9Has CodeInterdiscip Sci Comput Life Sci
Originality Incremental advance
AI Analysis

This work addresses the need for reliable evaluation of protein structure prediction methods in computational biology, though it appears incremental by building on existing structure prediction techniques.

The authors tackled the problem of distinguishing between experimentally resolved and computationally predicted protein structures by developing CPE-Pro, a structure-sensitive deep learning model that accurately traces the origin of structures across four data classes.

Protein structures are important for understanding their functions and interactions. Currently, many protein structure prediction methods are enriching the structure database. Discriminating the origin of structures is crucial for distinguishing between experimentally resolved and computationally predicted structures, evaluating the reliability of prediction methods, and guiding downstream biological studies. Building on works in structure prediction, We developed a structure-sensitive supervised deep learning model, Crystal vs Predicted Evaluator for Protein Structure (CPE-Pro), to represent and discriminate the origin of protein structures. CPE-Pro learns the structural information of proteins and captures inter-structural differences to achieve accurate traceability on four data classes, and is expected to be extended to more. Simultaneously, we utilized Foldseek to encode protein structures into "structure-sequences" and trained a protein Structural Sequence Language Model, SSLM. Preliminary experiments demonstrated that, compared to large-scale protein language models pre-trained on vast amounts of amino acid sequences, the "structure-sequence" enables the language model to learn more informative protein features, enhancing and optimizing structural representations. We have provided the code, model weights, and all related materials on https://github.com/GouWenrui/CPE-Pro-main.git.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes