CLJul 18, 2024

Transformer-based Single-Cell Language Model: A Survey

arXiv:2407.13205v128 citationsh-index: 10
Originality Synthesis-oriented
AI Analysis

This is an incremental review paper summarizing existing research for computational biology researchers working with single-cell data.

This paper surveys transformer-based models for single-cell data analysis, reviewing their architectures, applications in tasks like batch correction and cell type annotation, and discussing current challenges and future directions.

The transformers have achieved significant accomplishments in the natural language processing as its outstanding parallel processing capabilities and highly flexible attention mechanism. In addition, increasing studies based on transformers have been proposed to model single-cell data. In this review, we attempt to systematically summarize the single-cell language models and applications based on transformers. First, we provide a detailed introduction about the structure and principles of transformers. Then, we review the single-cell language models and large language models for single-cell data analysis. Moreover, we explore the datasets and applications of single-cell language models in downstream tasks such as batch correction, cell clustering, cell type annotation, gene regulatory network inference and perturbation response. Further, we discuss the challenges of single-cell language models and provide promising research directions. We hope this review will serve as an up-to-date reference for researchers interested in the direction of single-cell language models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes