MXtalTools: A Toolkit for Machine Learning on Molecular Crystals
This toolkit addresses the need for efficient machine learning workflows in molecular crystal research, but it is incremental as it builds on existing methods by providing modular utilities.
The authors tackled the problem of data-driven modeling for molecular crystals by developing MXtalTools, a flexible Python toolkit that facilitates machine learning studies, resulting in an open-source package with CUDA acceleration for high-throughput crystal modeling.
We present MXtalTools, a flexible Python package for the data-driven modelling of molecular crystals, facilitating machine learning studies of the molecular solid state. MXtalTools comprises several classes of utilities: (1) synthesis, collation, and curation of molecule and crystal datasets, (2) integrated workflows for model training and inference, (3) crystal parameterization and representation, (4) crystal structure sampling and optimization, (5) end-to-end differentiable crystal sampling, construction and analysis. Our modular functions can be integrated into existing workflows or combined and used to build novel modelling pipelines. MXtalTools leverages CUDA acceleration to enable high-throughput crystal modelling. The Python code is available open-source on our GitHub page, with detailed documentation on ReadTheDocs.