LGCOMP-PHJul 1, 2025

Tensor Decomposition Networks for Fast Machine Learning Interatomic Potential Computations

arXiv:2507.01131v3h-index: 10Has Code
Originality Highly original
AI Analysis

This work addresses the bottleneck of slow computations in interatomic potential models for computational chemistry, offering a plug-and-play acceleration method with proven error bounds and universality.

The paper tackles the computational expense of Clebsch-Gordan tensor products in SO(3)-equivariant networks for machine learning interatomic potentials by developing tensor decomposition networks (TDNs) that replace these products with low-rank decompositions, reducing complexity from O(L^6) to O(L^4) and achieving competitive performance on datasets like PubChemQCR, OC20, and OC22.

$\rm{SO}(3)$-equivariant networks are the dominant models for machine learning interatomic potentials (MLIPs). The key operation of such networks is the Clebsch-Gordan (CG) tensor product, which is computationally expensive. To accelerate the computation, we develop tensor decomposition networks (TDNs) as a class of approximately equivariant networks in which CG tensor products are replaced by low-rank tensor decompositions, such as the CANDECOMP/PARAFAC (CP) decomposition. With the CP decomposition, we prove (i) a uniform bound on the induced error of $\rm{SO}(3)$-equivariance, and (ii) the universality of approximating any equivariant bilinear map. To further reduce the number of parameters, we propose path-weight sharing that ties all multiplicity-space weights across the $\mathcal{O}(L^3)$ CG paths into a single path without compromising equivariance, where $L$ is the maximum angular degree. The resulting layer acts as a plug-and-play replacement for tensor products in existing networks, and the computational complexity of tensor products is reduced from $\mathcal{O}(L^6)$ to $\mathcal{O}(L^4)$. We evaluate TDNs on PubChemQCR, a newly curated molecular relaxation dataset containing 105 million DFT-calculated snapshots. We also use existing datasets, including OC20, and OC22. Results show that TDNs achieve competitive performance with dramatic speedup in computations. Our code is publicly available as part of the AIRS library (\href{https://github.com/divelab/AIRS/tree/main/OpenMol/TDN}{https://github.com/divelab/AIRS/}).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes