LGAIJun 11, 2025

EnerBridge-DPO: Energy-Guided Protein Inverse Folding with Markov Bridges and Direct Preference Optimization

arXiv:2506.09496v12 citationsh-index: 2
Originality Incremental advance
AI Analysis

This work addresses a key challenge in protein design for researchers and practitioners by integrating energy guidance into inverse folding, though it appears incremental as it builds on existing methods like Markov Bridges and DPO.

The paper tackles the problem of designing protein sequences with optimal energetic stability in protein inverse folding, where current methods focus on sequence recovery but neglect energy. The result is EnerBridge-DPO, a model that generates sequences with lower energy while maintaining comparable sequence recovery rates to state-of-the-art models and accurately predicts ΔΔG values.

Designing protein sequences with optimal energetic stability is a key challenge in protein inverse folding, as current deep learning methods are primarily trained by maximizing sequence recovery rates, often neglecting the energy of the generated sequences. This work aims to overcome this limitation by developing a model that directly generates low-energy, stable protein sequences. We propose EnerBridge-DPO, a novel inverse folding framework focused on generating low-energy, high-stability protein sequences. Our core innovation lies in: First, integrating Markov Bridges with Direct Preference Optimization (DPO), where energy-based preferences are used to fine-tune the Markov Bridge model. The Markov Bridge initiates optimization from an information-rich prior sequence, providing DPO with a pool of structurally plausible sequence candidates. Second, an explicit energy constraint loss is introduced, which enhances the energy-driven nature of DPO based on prior sequences, enabling the model to effectively learn energy representations from a wealth of prior knowledge and directly predict sequence energy values, thereby capturing quantitative features of the energy landscape. Our evaluations demonstrate that EnerBridge-DPO can design protein complex sequences with lower energy while maintaining sequence recovery rates comparable to state-of-the-art models, and accurately predicts $ΔΔG$ values between various sequences.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes