LGBMSep 29, 2025

MarS-FM: Generative Modeling of Molecular Dynamics via Markov State Models

arXiv:2509.24779v22 citationsh-index: 13
Originality Highly original
AI Analysis

This addresses the problem of high computational cost in simulating protein dynamics for researchers in computational biology, offering a significant speedup with improved accuracy.

The paper tackles the computational expense of Molecular Dynamics (MD) simulations by introducing MarS-FM, a generative model that learns transitions across discrete states from Markov State Models, achieving over two orders of magnitude speedup compared to MD simulations while outperforming existing methods across structural metrics.

Molecular Dynamics (MD) is a powerful computational microscope for probing protein functions. However, the need for fine-grained integration and the long timescales of biomolecular events make MD computationally expensive. To address this, several generative models have been proposed to generate surrogate trajectories at lower cost. Yet, these models typically learn a fixed-lag transition density, causing the training signal to be dominated by frequent but uninformative transitions. We introduce a new class of generative models, MSM Emulators, which instead learn to sample transitions across discrete states defined by an underlying Markov State Model (MSM). We instantiate this class with Markov Space Flow Matching (MarS-FM), whose sampling offers more than two orders of magnitude speedup compared to implicit- or explicit-solvent MD simulations. We benchmark Mars-FM ability to reproduce MD statistics through structural observables such as RMSD, radius of gyration, and secondary structure content. Our evaluation spans protein domains (up to 500 residues) with significant chemical and structural diversity, including unfolding events, and enforces strict sequence dissimilarity between training and test sets to assess generalization. Across all metrics, MarS-FM outperforms existing methods, often by a substantial margin.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes