LGCHEM-PHBMNov 23, 2022

Learning Regularized Positional Encoding for Molecular Prediction

arXiv:2211.12773v12 citationsh-index: 47
Originality Incremental advance
AI Analysis

This work addresses molecular modeling for computational chemistry, offering an incremental improvement over manual design methods.

The paper tackled the problem of predicting molecular properties by proposing a learnable, regularized positional encoding for interatomic distances and bond angles, resulting in improved performance across three model architectures and enabling easier physics-based interpretation.

Machine learning has become a promising approach for molecular modeling. Positional quantities, such as interatomic distances and bond angles, play a crucial role in molecule physics. The existing works rely on careful manual design of their representation. To model the complex nonlinearity in predicting molecular properties in an more end-to-end approach, we propose to encode the positional quantities with a learnable embedding that is continuous and differentiable. A regularization technique is employed to encourage embedding smoothness along the physical dimension. We experiment with a variety of molecular property and force field prediction tasks. Improved performance is observed for three different model architectures after plugging in the proposed positional encoding method. In addition, the learned positional encoding allows easier physics-based interpretation. We observe that tasks of similar physics have the similar learned positional encoding.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes