MTRL-SCI LGJan 16, 2024

Structure-based out-of-distribution (OOD) materials property prediction: a benchmark study

Sadman Sadeed Omee, Nihang Fu, Rongzhi Dong, Ming Hu, Jianjun Hu

arXiv:2401.08032v110.853 citationsHas Codenpj Comput Mater

Originality Incremental advance

AI Analysis

This addresses the problem of overestimating ML model performance in materials science due to dataset redundancy, which is crucial for researchers aiming to discover novel materials.

The study benchmarks graph neural networks (GNNs) for predicting properties of out-of-distribution (OOD) materials, revealing that state-of-the-art models underperform significantly compared to baselines, with average performance gaps highlighting a generalization issue.

In real-world material research, machine learning (ML) models are usually expected to predict and discover novel exceptional materials that deviate from the known materials. It is thus a pressing question to provide an objective evaluation of ML model performances in property prediction of out-of-distribution (OOD) materials that are different from the training set distribution. Traditional performance evaluation of materials property prediction models through random splitting of the dataset frequently results in artificially high performance assessments due to the inherent redundancy of typical material datasets. Here we present a comprehensive benchmark study of structure-based graph neural networks (GNNs) for extrapolative OOD materials property prediction. We formulate five different categories of OOD ML problems for three benchmark datasets from the MatBench study. Our extensive experiments show that current state-of-the-art GNN algorithms significantly underperform for the OOD property prediction tasks on average compared to their baselines in the MatBench study, demonstrating a crucial generalization gap in realistic material prediction tasks. We further examine the latent physical spaces of these GNN models and identify the sources of CGCNN, ALIGNN, and DeeperGATGNN's significantly more robust OOD performance than those of the current best models in the MatBench study (coGN and coNGN), and provide insights to improve their performance.

View on arXiv PDF Code

Similar