LGDec 5, 2024

Boundary-Guided Learning for Gene Expression Prediction in Spatial Transcriptomics

arXiv:2412.04072v27 citationsh-index: 7BIBM
Originality Incremental advance
AI Analysis

This work addresses a domain-specific challenge in bioinformatics by enhancing gene expression prediction from pathological images, though it appears incremental as it builds on prior deep learning approaches with a novel focus on boundary features.

The paper tackled the problem of predicting gene expression from whole slide images in spatial transcriptomics by proposing BG-TRIPLEX, a framework that uses boundary information to guide learning, and it achieved improved performance over existing methods as measured by Pearson Correlation Coefficient on three public datasets.

Spatial transcriptomics (ST) has emerged as an advanced technology that provides spatial context to gene expression. Recently, deep learning-based methods have shown the capability to predict gene expression from WSI data using ST data. Existing approaches typically extract features from images and the neighboring regions using pretrained models, and then develop methods to fuse this information to generate the final output. However, these methods often fail to account for the cellular structure similarity, cellular density and the interactions within the microenvironment. In this paper, we propose a framework named BG-TRIPLEX, which leverages boundary information extracted from pathological images as guiding features to enhance gene expression prediction from WSIs. Specifically, our model consists of three branches: the spot, in-context and global branches. In the spot and in-context branches, boundary information, including edge and nuclei characteristics, is extracted using pretrained models. These boundary features guide the learning of cellular morphology and the characteristics of microenvironment through Multi-Head Cross-Attention. Finally, these features are integrated with global features to predict the final output. Extensive experiments were conducted on three public ST datasets. The results demonstrate that our BG-TRIPLEX consistently outperforms existing methods in terms of Pearson Correlation Coefficient (PCC). This method highlights the crucial role of boundary features in understanding the complex interactions between WSI and gene expression, offering a promising direction for future research.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes