LG BMMay 4, 2025

NbBench: Benchmarking Language Models for Comprehensive Nanobody Tasks

arXiv:2505.02022v24.14 citationsh-index: 6Has CodeMachine Learning: Science and Technology

Originality Synthesis-oriented

AI Analysis

This work addresses a gap in biomolecular AI by providing a standardized benchmark for researchers in therapeutics and diagnostics, though it is incremental as it focuses on evaluation rather than new methods.

The authors tackled the lack of a unified benchmark for nanobody-specific modeling by introducing NbBench, a comprehensive benchmark suite spanning eight tasks across nine datasets, and found that antibody language models perform well in antigen-related tasks but all models struggle with regression tasks like thermostability and affinity, with no single model consistently outperforming others.

Nanobodies -- single-domain antibody fragments derived from camelid heavy-chain-only antibodies -- exhibit unique advantages such as compact size, high stability, and strong binding affinity, making them valuable tools in therapeutics and diagnostics. While recent advances in pretrained protein and antibody language models (PPLMs and PALMs) have greatly enhanced biomolecular understanding, nanobody-specific modeling remains underexplored and lacks a unified benchmark. To address this gap, we introduce NbBench, the first comprehensive benchmark suite for nanobody representation learning. Spanning eight biologically meaningful tasks across nine curated datasets, NbBench encompasses structure annotation, binding prediction, and developability assessment. We systematically evaluate eleven representative models -- including general-purpose protein LMs, antibody-specific LMs, and nanobody-specific LMs -- in a frozen setting. Our analysis reveals that antibody language models excel in antigen-related tasks, while performance on regression tasks such as thermostability and affinity remains challenging across all models. Notably, no single model consistently outperforms others across all tasks. By standardizing datasets, task definitions, and evaluation protocols, NbBench offers a reproducible foundation for assessing and advancing nanobody modeling.

View on arXiv PDF Code

Similar