QM AI LG COMP-PHApr 4, 2025

JanusDDG: A Thermodynamics-Compliant Model for Sequence-Based Protein Stability via Two-Fronts Multi-Head Attention

Guido Barducci, Ivan Rossi, Francesco Codicè, Cesare Rollo, Valeria Repetto, Corrado Pancotti, Virginia Iannibelli, Tiziana Sanavia, Piero Fariselli

arXiv:2504.03278v21.2h-index: 14Has Code

Originality Highly original

AI Analysis

This work addresses the need for accurate protein stability prediction from sequence alone, which is crucial for protein design and disease mutation analysis, representing a strong specific gain in computational biology.

The paper tackled the problem of predicting how residue mutations affect protein stability (ΔΔG) by introducing JanusDDG, a deep learning model that uses protein language model embeddings and a bidirectional cross-attention transformer, achieving state-of-the-art performance that matches or exceeds structure-based methods for single and multiple mutations.

Understanding how residue variations affect protein stability is crucial for designing functional proteins and deciphering the molecular mechanisms underlying disease-related mutations. Recent advances in protein language models (PLMs) have revolutionized computational protein analysis, enabling, among other things, more accurate predictions of mutational effects. In this work, we introduce JanusDDG, a deep learning framework that leverages PLM-derived embeddings and a bidirectional cross-attention transformer architecture to predict $ΔΔG$ of single and multiple-residue mutations while simultaneously being constrained to respect fundamental thermodynamic properties, such as antisymmetry and transitivity. Unlike conventional self-attention, JanusDDG computes queries (Q) and values (V) as the difference between wild-type and mutant embeddings, while keys (K) alternate between the two. This cross-interleaved attention mechanism enables the model to capture mutation-induced perturbations while preserving essential contextual information. Experimental results show that JanusDDG achieves state-of-the-art performance in predicting $ΔΔG$ from sequence alone, matching or exceeding the accuracy of structure-based methods for both single and multiple mutations. Code Availability:https://github.com/compbiomed-unito/JanusDDG

View on arXiv PDF Code

Similar