GNAIFeb 19, 2025

Helix-mRNA: A Hybrid Foundation Model For Full Sequence mRNA Therapeutics

arXiv:2502.13785v27 citationsh-index: 1Has Code
Originality Highly original
AI Analysis

This work addresses mRNA vaccine optimization for the pharmaceutical industry, representing a novel method for a known bottleneck.

The authors tackled the challenge of optimizing mRNA sequences for therapeutic properties by developing Helix-mRNA, a hybrid model that processes full sequences including UTRs, achieving 6x longer sequence processing with 10% of the parameters of existing models.

mRNA-based vaccines have become a major focus in the pharmaceutical industry. The coding sequence as well as the Untranslated Regions (UTRs) of an mRNA can strongly influence translation efficiency, stability, degradation, and other factors that collectively determine a vaccine's effectiveness. However, optimizing mRNA sequences for those properties remains a complex challenge. Existing deep learning models often focus solely on coding region optimization, overlooking the UTRs. We present Helix-mRNA, a structured state-space-based and attention hybrid model to address these challenges. In addition to a first pre-training, a second pre-training stage allows us to specialise the model with high-quality data. We employ single nucleotide tokenization of mRNA sequences with codon separation, ensuring prior biological and structural information from the original mRNA sequence is not lost. Our model, Helix-mRNA, outperforms existing methods in analysing both UTRs and coding region properties. It can process sequences 6x longer than current approaches while using only 10% of the parameters of existing foundation models. Its predictive capabilities extend to all mRNA regions. We open-source the model (https://github.com/helicalAI/helical) and model weights (https://huggingface.co/helical-ai/helix-mRNA).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes