MTRL-SCILGDec 10, 2025

Transport Novelty Distance: A Distributional Metric for Evaluating Material Generative Models

arXiv:2512.09514v11 citationsh-index: 3
Originality Incremental advance
AI Analysis

It addresses the need for rigorous evaluation metrics in material generative models, which is incremental as it builds on existing methods but introduces a novel metric.

The paper tackles the problem of evaluating generative models for materials discovery by introducing the Transport Novelty Distance (TNovD), a metric that jointly assesses quality and novelty of generated structures, validated on datasets like MP20 and WBM substitution to detect memorized and low-quality data.

Recent advances in generative machine learning have opened new possibilities for the discovery and design of novel materials. However, as these models become more sophisticated, the need for rigorous and meaningful evaluation metrics has grown. Existing evaluation approaches often fail to capture both the quality and novelty of generated structures, limiting our ability to assess true generative performance. In this paper, we introduce the Transport Novelty Distance (TNovD) to judge generative models used for materials discovery jointly by the quality and novelty of the generated materials. Based on ideas from Optimal Transport theory, TNovD uses a coupling between the features of the training and generated sets, which is refined into a quality and memorization regime by a threshold. The features are generated from crystal structures using a graph neural network that is trained to distinguish between materials, their augmented counterparts, and differently sized supercells using contrastive learning. We evaluate our proposed metric on typical toy experiments relevant for crystal structure prediction, including memorization, noise injection and lattice deformations. Additionally, we validate the TNovD on the MP20 validation set and the WBM substitution dataset, demonstrating that it is capable of detecting both memorized and low-quality material data. We also benchmark the performance of several popular material generative models. While introduced for materials, our TNovD framework is domain-agnostic and can be adapted for other areas, such as images and molecules.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes