BMAILGNEQMApr 25, 2016

Protein Secondary Structure Prediction Using Cascaded Convolutional and Recurrent Neural Networks

arXiv:1604.07176v1121 citations
Originality Incremental advance
AI Analysis

This addresses a key bioinformatics problem for protein structure analysis, but it is incremental as it builds on existing deep learning methods.

The paper tackled protein secondary structure prediction by proposing an end-to-end deep network combining convolutional and recurrent neural networks with multi-task learning, achieving state-of-the-art accuracies of 69.7% on CB513, 76.9% on CASP10, and 73.1% on CASP11.

Protein secondary structure prediction is an important problem in bioinformatics. Inspired by the recent successes of deep neural networks, in this paper, we propose an end-to-end deep network that predicts protein secondary structures from integrated local and global contextual features. Our deep architecture leverages convolutional neural networks with different kernel sizes to extract multiscale local contextual features. In addition, considering long-range dependencies existing in amino acid sequences, we set up a bidirectional neural network consisting of gated recurrent unit to capture global contextual features. Furthermore, multi-task learning is utilized to predict secondary structure labels and amino-acid solvent accessibility simultaneously. Our proposed deep network demonstrates its effectiveness by achieving state-of-the-art performance, i.e., 69.7% Q8 accuracy on the public benchmark CB513, 76.9% Q8 accuracy on CASP10 and 73.1% Q8 accuracy on CASP11. Our model and results are publicly available.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes