GNLGMLMay 16, 2019

ncRNA Classification with Graph Convolutional Networks

arXiv:1905.06515v136 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of ncRNA classification for bioinformatics researchers, representing an incremental improvement over existing methods.

The paper tackles the problem of classifying non-coding RNA sequences into families by using a graph convolutional network model, achieving an accuracy of 85.73% and an F1-score of 85.61% over 13 classes.

Non-coding RNA (ncRNA) are RNA sequences which don't code for a gene but instead carry important biological functions. The task of ncRNA classification consists in classifying a given ncRNA sequence into its family. While it has been shown that the graph structure of an ncRNA sequence folding is of great importance for the prediction of its family, current methods make use of machine learning classifiers on hand-crafted graph features. We improve on the state-of-the-art for this task with a graph convolutional network model which achieves an accuracy of 85.73% and an F1-score of 85.61% over 13 classes. Moreover, our model learns in an end-to-end fashion from the raw RNA graphs and removes the need for expensive feature extraction. To the best of our knowledge, this also represents the first successful application of graph convolutional networks to RNA folding data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes