BMLGJan 25, 2023

Unsupervised Protein-Ligand Binding Energy Prediction via Neural Euler's Rotation Equation

arXiv:2301.10814v235 citationsh-index: 120
Originality Highly original
AI Analysis

This addresses the challenge of limited labeled data in drug discovery, particularly for antibodies, by introducing an unsupervised approach.

The paper tackles the problem of predicting protein-ligand binding energy without labeled data by reformulating it as a generative modeling task using SE(3) denoising score matching, achieving performance that matches supervised methods for antibody-antigen binding.

Protein-ligand binding prediction is a fundamental problem in AI-driven drug discovery. Prior work focused on supervised learning methods using a large set of binding affinity data for small molecules, but it is hard to apply the same strategy to other drug classes like antibodies as labelled data is limited. In this paper, we explore unsupervised approaches and reformulate binding energy prediction as a generative modeling task. Specifically, we train an energy-based model on a set of unlabelled protein-ligand complexes using SE(3) denoising score matching and interpret its log-likelihood as binding affinity. Our key contribution is a new equivariant rotation prediction network called Neural Euler's Rotation Equations (NERE) for SE(3) score matching. It predicts a rotation by modeling the force and torque between protein and ligand atoms, where the force is defined as the gradient of an energy function with respect to atom coordinates. We evaluate NERE on protein-ligand and antibody-antigen binding affinity prediction benchmarks. Our model outperforms all unsupervised baselines (physics-based and statistical potentials) and matches supervised learning methods in the antibody case.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes