LGGNMLMay 6, 2019

A joint model of unpaired data from scRNA-seq and spatial transcriptomics for imputing missing gene expression measurements

arXiv:1905.02269v1144 citations
AI Analysis

This addresses a bottleneck in spatial transcriptomics for biologists by enabling more comprehensive gene expression analysis without prior gene selection, though it is incremental as it builds on existing domain adaptation work.

The authors tackled the problem of imputing missing gene expression measurements in spatial transcriptomic data using unpaired single-cell RNA-seq data, proposing gimVI, a deep generative model that outperformed alternative methods like Seurat Anchors, Liger, and CORAL on real datasets.

Spatial studies of transcriptome provide biologists with gene expression maps of heterogeneous and complex tissues. However, most experimental protocols for spatial transcriptomics suffer from the need to select beforehand a small fraction of genes to be quantified over the entire transcriptome. Standard single-cell RNA sequencing (scRNA-seq) is more prevalent, easier to implement and can in principle capture any gene but cannot recover the spatial location of the cells. In this manuscript, we focus on the problem of imputation of missing genes in spatial transcriptomic data based on (unpaired) standard scRNA-seq data from the same biological tissue. Building upon domain adaptation work, we propose gimVI, a deep generative model for the integration of spatial transcriptomic data and scRNA-seq data that can be used to impute missing genes. After describing our generative model and an inference procedure for it, we compare gimVI to alternative methods from computational biology or domain adaptation on real datasets and outperform Seurat Anchors, Liger and CORAL to impute held-out genes.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes