CHEM-PHAIJan 2, 2025

Constructing and explaining machine learning models for chemistry: example of the exploration and design of boron-based Lewis acids

arXiv:2501.01576v3h-index: 4
Originality Incremental advance
AI Analysis

This work addresses the problem of interpretability in molecular design for chemists, offering actionable insights into chemical reactivity, though it is incremental in applying explainable AI to a specific domain.

The study tackled the challenge of designing boron-based Lewis acids with targeted properties by developing interpretable machine learning models that accurately predict Fluoride Ion Affinity (mean absolute error < 6 kJ/mol), surpassing black-box models in low-data regimes.

The integration of machine learning (ML) into chemistry offers transformative potential in the design of molecules with targeted properties. However, the focus has often been on creating highly efficient predictive models, sometimes at the expense of interpretability. In this study, we leverage explainable AI techniques to explore the rational design of boron-based Lewis acids, which play a pivotal role in organic reactions due to their electron-ccepting properties. Using Fluoride Ion Affinity as a proxy for Lewis acidity, we developed interpretable ML models based on chemically meaningful descriptors, including ab initio computed features and substituent-based parameters derived from the Hammett linear free-energy relationship. By constraining the chemical space to well-defined molecular scaffolds, we achieved highly accurate predictions (mean absolute error < 6 kJ/mol), surpassing conventional black-box deep learning models in low-data regimes. Interpretability analyses of the models shed light on the origin of Lewis acidity in these compounds and identified actionable levers to modulate it through the nature and positioning of substituents on the molecular scaffold. This work bridges ML and chemist's way of thinking, demonstrating how explainable models can inspire molecular design and enhance scientific understanding of chemical reactivity.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes