BIO-PHAILGNov 13, 2025

Completion of partial structures using Patterson maps with the CrysFormer machine learning model

arXiv:2511.10440v12 citationsh-index: 11Acta Crystallographica Section D: Structural Biology
Originality Incremental advance
AI Analysis

This work addresses the problem of incomplete protein structures in structural biology by combining traditional crystallography with ML, though it is incremental as it builds on existing methods like AlphaFold.

The paper tackled the challenge of protein structure determination by integrating experimental X-ray crystallographic data with machine learning, using Patterson maps and partial structure templates to predict electron density maps, which improved phase accuracy and completed missing regions in structures.

Protein structure determination has long been one of the primary challenges of structural biology, to which deep machine learning (ML)-based approaches have increasingly been applied. However, these ML models generally do not incorporate the experimental measurements directly, such as X-ray crystallographic diffraction data. To this end, we explore an approach that more tightly couples these traditional crystallographic and recent ML-based methods, by training a hybrid 3-d vision transformer and convolutional network on inputs from both domains. We make use of two distinct input constructs / Patterson maps, which are directly obtainable from crystallographic data, and ``partial structure'' template maps derived from predicted structures deposited in the AlphaFold Protein Structure Database with subsequently omitted residues. With these, we predict electron density maps that are then post-processed into atomic models through standard crystallographic refinement processes. Introducing an initial dataset of small protein fragments taken from Protein Data Bank entries and placing them in hypothetical crystal settings, we demonstrate that our method is effective at both improving the phases of the crystallographic structure factors and completing the regions missing from partial structure templates, as well as improving the agreement of the electron density maps with the ground truth atomic structures.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes