MTRL-SCIJun 14, 2023
A generative artificial intelligence framework based on a molecular diffusion model for the design of metal-organic frameworks for carbon captureHyun Park, Xiaoli Yan, Ruijie Zhu et al.
Metal-organic frameworks (MOFs) exhibit great promise for CO2 capture. However, finding the best performing materials poses computational and experimental grand challenges in view of the vast chemical space of potential building blocks. Here, we introduce GHP-MOFassemble, a generative artificial intelligence (AI), high performance framework for the rational and accelerated design of MOFs with high CO2 adsorption capacity and synthesizable linkers. GHP-MOFassemble generates novel linkers, assembled with one of three pre-selected metal nodes (Cu paddlewheel, Zn paddlewheel, Zn tetramer) into MOFs in a primitive cubic topology. GHP-MOFassemble screens and validates AI-generated MOFs for uniqueness, synthesizability, structural validity, uses molecular dynamics simulations to study their stability and chemical consistency, and crystal graph neural networks and Grand Canonical Monte Carlo simulations to quantify their CO2 adsorption capacities. We present the top six AI-generated MOFs with CO2 capacities greater than 2 $m mol/g$, i.e., higher than 96.9% of structures in the hypothetical MOF dataset.
DCJan 18, 2025Code
MOFA: Discovering Materials for Carbon Capture with a GenAI- and Simulation-Based WorkflowXiaoli Yan, Nathaniel Hudson, Hyun Park et al.
We present MOFA, an open-source generative AI (GenAI) plus simulation workflow for high-throughput generation of metal-organic frameworks (MOFs) on large-scale high-performance computing (HPC) systems. MOFA addresses key challenges in integrating GPU-accelerated computing for GPU-intensive GenAI tasks, including distributed training and inference, alongside CPU- and GPU-optimized tasks for screening and filtering AI-generated MOFs using molecular dynamics, density functional theory, and Monte Carlo simulations. These heterogeneous tasks are unified within an online learning framework that optimizes the utilization of available CPU and GPU resources across HPC systems. Performance metrics from a 450-node (14,400 AMD Zen 3 CPUs + 1800 NVIDIA A100 GPUs) supercomputer run demonstrate that MOFA achieves high-throughput generation of novel MOF structures, with CO$_2$ adsorption capacities ranking among the top 10 in the hypothetical MOF (hMOF) dataset. Furthermore, the production of high-quality MOFs exhibits a linear relationship with the number of nodes utilized. The modular architecture of MOFA will facilitate its integration into other scientific applications that dynamically combine GenAI with large-scale simulations.
LGJul 8, 2022
Robust Newsvendor Problem in Global Market: Stable Operation Strategy for a Two-Market Stochastic SystemXiaoli Yan
The global markets provide enterprises with selling opportunities and challenges in stabilizing operational strategies. From the perspective of production management, it is important to improve the profitability of an enterprise by exploiting the different timing of the selling season in different markets to develop an operational strategy that is optimized and configured on a global scale. This paper examines the above issue with an insightful model of selling the product to two markets (a primary and a secondary market) with multiple risks of changes in the market environment and nonoverlapping selling seasons. We refer to this problem as the "global robust newsvendor" problem. We provide closed-form solutions of the optimal operation strategy for demand-independent and demand-related scenarios for the above two market stochastic systems. The closed-form solutions fully reflect the influence of the relationship between supply and demand on strategy selection. We find that the demand correlation and the lack of demand information will not substantially affect the operation strategy, and the enterprise's industrial chain and supply chain remain stable. However, the reduction of inter-market tariffs or logistics costs will cause changes, and the existence of the secondary market will lead to more capacity planning in the primary market. In addition, our model explicitly considers the impact of exchange rate uncertainty on operating strategies.
LGSep 29, 2025
Steering an Active Learning Workflow Towards Novel Materials Discovery via Queue PrioritizationMarcus Schwarting, Logan Ward, Nathaniel Hudson et al.
Generative AI poses both opportunities and risks for solving inverse design problems in the sciences. Generative tools provide the ability to expand and refine a search space autonomously, but do so at the cost of exploring low-quality regions until sufficiently fine tuned. Here, we propose a queue prioritization algorithm that combines generative modeling and active learning in the context of a distributed workflow for exploring complex design spaces. We find that incorporating an active learning model to prioritize top design candidates can prevent a generative AI workflow from expending resources on nonsensical candidates and halt potential generative model decay. For an existing generative AI workflow for discovering novel molecular structure candidates for carbon capture, our active learning approach significantly increases the number of high-quality candidates identified by the generative model. We find that, out of 1000 novel candidates, our workflow without active learning can generate an average of 281 high-performing candidates, while our proposed prioritization with active learning can generate an average 604 high-performing candidates.