PRISMat: Policy-Driven, Permutation-Invariant Autoregressive Material Generation

arXiv:2605.1661260.3
AI Analysis

For materials scientists, PRISMat offers a computationally efficient alternative to LLMs for high-throughput material discovery, achieving superior accuracy on surface property prediction.

PRISMat is a cost-effective, permutation-invariant model for generating crystal slabs conditioned on surface properties, outperforming LLMs with 4× lower error (0.188 eV/Ų for cleavage energy, 2.79 eV for work function) while requiring less inference time.

Rapid identification of candidate materials with target properties has become a key task in materials science. Machine learning has emerged as an alternative to physics-based simulation, offering a faster and cheaper way to filter materials based on their stability and other target properties, reducing the number of candidates that reach the costly synthesis stage. Recently, Large Language Models (LLMs) have been applied to this role, but these models are parameter-heavy and computationally expensive both during training and at inference time, making them unsuitable for high-throughput tasks. This inefficiency stems from both the large over-parameterization of language models and the difficulty of framing material generation as a sequence learning problem. In this paper, we present PRISMat, a cost-effective, permutation-invariant model, which addresses these limitations. We show that PRISMat, despite taking less time for inference, is able to outperform LLMs in generating crystal slabs conditioned on critical materials' surface properties. In targeted material discovery, we achieve mean absolute errors of 0.188 eV/A$^2$ and 2.79 eV for cleavage energy and work function tasks, respectively, reducing the error of the next best model by 4$\times$.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes