LGAIQMJun 17, 2022

Transformer Neural Networks Attending to Both Sequence and Structure for Protein Prediction Tasks

arXiv:2206.11057v15 citationsh-index: 29
Originality Incremental advance
AI Analysis

This work addresses protein function prediction for bioinformatics, but it is incremental as it builds on existing transformer methods by incorporating structure.

The paper tackles the problem of linking protein sequence to function by proposing a transformer neural network that attends to both sequence and tertiary structure, showing that joint representations yield better performance on superfamily membership across various metrics.

The increasing number of protein sequences decoded from genomes is opening up new avenues of research on linking protein sequence to function with transformer neural networks. Recent research has shown that the number of known protein sequences supports learning useful, task-agnostic sequence representations via transformers. In this paper, we posit that learning joint sequence-structure representations yields better representations for function-related prediction tasks. We propose a transformer neural network that attends to both sequence and tertiary structure. We show that such joint representations are more powerful than sequence-based representations only, and they yield better performance on superfamily membership across various metrics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes