QMDCLGBMMar 16

Fold-CP: A Context Parallelism Framework for Biomolecular Modeling

arXiv:2603.1480684.1h-index: 17Has Code
AI Analysis

This addresses the problem of scaling biomolecular modeling for researchers by providing a scalable pathway to model massive systems, representing an incremental improvement in computational efficiency.

The paper tackles the hardware memory limitations in predicting large biomolecular structures by introducing Fold-CP, a context parallelism framework that distributes inference and training across multiple GPUs, enabling structure prediction of assemblies exceeding 30,000 residues on 64 GPUs and scoring over 90% of the CORUM database.

Understanding cellular machinery requires atomic-scale reconstruction of large biomolecular assemblies. However, predicting the structures of these systems has been constrained by hardware memory requirements of models like AlphaFold 3, imposing a practical ceiling of a few thousand residues that can be processed on a single GPU. Here we present NVIDIA BioNeMo Fold-CP, a context parallelism framework that overcomes this barrier by distributing the inference and training pipelines of co-folding models across multiple GPUs. We use the Boltz models as open source reference architectures and implement custom multidimensional primitives that efficiently parallelize both the dense triangular updates and the irregular, data-dependent pattern of window-batched local attention. Our approach achieves efficient memory scaling; for an N-token input distributed across P GPUs, per-device memory scales as $O(N^2/P)$, enabling the structure prediction of assemblies exceeding 30,000 residues on 64 NVIDIA B300 GPUs. We demonstrate the scientific utility of this approach through successful developer use cases: Fold-CP enabled the scoring of over 90% of Comprehensive Resource of Mammalian protein complexes (CORUM) database, as well as folding of disease-relevant PI4KA lipid kinase complex bound to an intrinsically disordered region without cropping. By providing a scalable pathway for modeling massive systems with full global context, Fold-CP represents a significant step toward the realization of a virtual cell.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes