LGAICOMP-PHJun 23, 2022

Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic Graphs

arXiv:2206.11990v2381 citationsh-index: 23
Originality Incremental advance
AI Analysis

This work addresses the problem of improving Transformer performance for 3D molecular data, which is incremental as it builds on existing methods with specific modifications.

The paper tackles the challenge of applying Transformer networks to 3D atomistic graphs by introducing Equiformer, which incorporates SE(3)/E(3)-equivariant features and a novel attention mechanism, achieving competitive results on QM9, MD17, and OC20 datasets.

Despite their widespread success in various domains, Transformer networks have yet to perform well across datasets in the domain of 3D atomistic graphs such as molecules even when 3D-related inductive biases like translational invariance and rotational equivariance are considered. In this paper, we demonstrate that Transformers can generalize well to 3D atomistic graphs and present Equiformer, a graph neural network leveraging the strength of Transformer architectures and incorporating SE(3)/E(3)-equivariant features based on irreducible representations (irreps). First, we propose a simple and effective architecture by only replacing original operations in Transformers with their equivariant counterparts and including tensor products. Using equivariant operations enables encoding equivariant information in channels of irreps features without complicating graph structures. With minimal modifications to Transformers, this architecture has already achieved strong empirical results. Second, we propose a novel attention mechanism called equivariant graph attention, which improves upon typical attention in Transformers through replacing dot product attention with multi-layer perceptron attention and including non-linear message passing. With these two innovations, Equiformer achieves competitive results to previous models on QM9, MD17 and OC20 datasets.

Code Implementations4 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes