OrgFlow: Generative Modeling of Organic Crystal Structures from Molecular Graphs

arXiv:2602.20195v1
AI Analysis

This addresses a gap in materials science for organic crystals, which are important for pharmaceuticals and functional materials, representing a novel method for a known bottleneck.

The paper tackles the problem of predicting organic crystal structures, which is challenging due to larger unit cells and strict chemical connectivity, by introducing a flow-matching model that achieves a Match Rate more than 10 times higher than existing baselines with fewer sampling steps.

Crystal structure prediction is a long-standing challenge in materials science, with most data-driven methods developed for inorganic systems. This leaves an important gap for organic crystals, which are central to pharmaceuticals, polymers, and functional materials, but present unique challenges, such as larger unit cells and strict chemical connectivity. We introduce a flow-matching model for predicting organic crystal structures directly from molecular graphs. The architecture integrates molecular connectivity with periodic boundary conditions while preserving the symmetries of crystalline systems. A bond-aware loss guides the model toward realistic local chemistry by enforcing distributions of bond lengths and connectivity. To support reliable and efficient training, we built a curated dataset of organic crystals, along with a preprocessing pipeline that precomputes bonds and edges, substantially reducing computational overhead during both training and inference. Experiments show that our method achieves a Match Rate more than 10 times higher than existing baselines while requiring fewer sampling steps for inference. These results establish generative modeling as a practical and scalable framework for organic crystal structure prediction.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes