LGBMJun 3, 2025

Protein Inverse Folding From Structure Feedback

arXiv:2506.03028v15 citationsh-index: 10
Originality Incremental advance
AI Analysis

This work addresses the challenge of designing amino acid sequences for desired protein structures, which is crucial for biotechnological applications, by proposing an incremental method that leverages preference optimization.

The paper tackled the protein inverse folding problem by introducing a Direct Preference Optimization (DPO) approach to fine-tune an inverse folding model using feedback from a protein folding model, resulting in improved sequence recovery and an average TM-Score increase from 0.77 to 0.81 on the CATH 4.2 test set, with iterative applications yielding a 79.5% gain.

The inverse folding problem, aiming to design amino acid sequences that fold into desired three-dimensional structures, is pivotal for various biotechnological applications. Here, we introduce a novel approach leveraging Direct Preference Optimization (DPO) to fine-tune an inverse folding model using feedback from a protein folding model. Given a target protein structure, we begin by sampling candidate sequences from the inverse-folding model, then predict the three-dimensional structure of each sequence with the folding model to generate pairwise structural-preference labels. These labels are used to fine-tune the inverse-folding model under the DPO objective. Our results on the CATH 4.2 test set demonstrate that DPO fine-tuning not only improves sequence recovery of baseline models but also leads to a significant improvement in average TM-Score from 0.77 to 0.81, indicating enhanced structure similarity. Furthermore, iterative application of our DPO-based method on challenging protein structures yields substantial gains, with an average TM-Score increase of 79.5\% with regard to the baseline model. This work establishes a promising direction for enhancing protein sequence design ability from structure feedback by effectively utilizing preference optimization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes