LGQMDec 16, 2025

Accelerating MHC-II Epitope Discovery via Multi-Scale Prediction in Antigen Presentation

arXiv:2512.14011v1h-index: 4
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of limited and non-standardized data for MHC-II epitope prediction in immunotherapy, though it is incremental as it builds on existing datasets and methods.

The authors tackled the challenge of predicting MHC-II epitopes by creating a well-curated dataset from public sources and formulating three machine learning tasks to model the antigen presentation pathway, resulting in a resource that standardizes and extends existing data for computational immunotherapy.

Antigenic epitope presented by major histocompatibility complex II (MHC-II) proteins plays an essential role in immunotherapy. However, compared to the more widely studied MHC-I in computational immunotherapy, the study of MHC-II antigenic epitope poses significantly more challenges due to its complex binding specificity and ambiguous motif patterns. Consequently, existing datasets for MHC-II interactions are smaller and less standardized than those available for MHC-I. To address these challenges, we present a well-curated dataset derived from the Immune Epitope Database (IEDB) and other public sources. It not only extends and standardizes existing peptide-MHC-II datasets, but also introduces a novel antigen-MHC-II dataset with richer biological context. Leveraging this dataset, we formulate three major machine learning (ML) tasks of peptide binding, peptide presentation, and antigen presentation, which progressively capture the broader biological processes within the MHC-II antigen presentation pathway. We further employ a multi-scale evaluation framework to benchmark existing models, along with a comprehensive analysis over various modeling designs to this problem with a modular framework. Overall, this work serves as a valuable resource for advancing computational immunotherapy, providing a foundation for future research in ML guided epitope discovery and predictive modeling of immune responses.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes