LGDec 10, 2025

Exploring Protein Language Model Architecture-Induced Biases for Antibody Comprehension

Mengren, Liu, Yixiang Zhang, Yiming, Zhang

arXiv:2512.09894v14.1

Originality Synthesis-oriented

AI Analysis

This work provides insights for computational antibody design by analyzing model biases, but it is incremental as it compares existing models without introducing new methods.

The study investigated how different protein language model architectures capture antibody-specific biological properties, finding that while all models achieved high classification accuracy, they exhibited distinct biases in capturing features like V gene usage and somatic hypermutation patterns.

Recent advances in protein language models (PLMs) have demonstrated remarkable capabilities in understanding protein sequences. However, the extent to which different model architectures capture antibody-specific biological properties remains unexplored. In this work, we systematically investigate how architectural choices in PLMs influence their ability to comprehend antibody sequence characteristics and functions. We evaluate three state-of-the-art PLMs-AntiBERTa, BioBERT, and ESM2--against a general-purpose language model (GPT-2) baseline on antibody target specificity prediction tasks. Our results demonstrate that while all PLMs achieve high classification accuracy, they exhibit distinct biases in capturing biological features such as V gene usage, somatic hypermutation patterns, and isotype information. Through attention attribution analysis, we show that antibody-specific models like AntiBERTa naturally learn to focus on complementarity-determining regions (CDRs), while general protein models benefit significantly from explicit CDR-focused training strategies. These findings provide insights into the relationship between model architecture and biological feature extraction, offering valuable guidance for future PLM development in computational antibody design.

View on arXiv PDF

Similar