MLGNQMAPOct 25, 2013

A feasible roadmap for unsupervised deconvolution of two-source mixed gene expressions

arXiv:1310.7033v12 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of studying individual cell populations in heterogeneous tissues for researchers in genomics and bioinformatics, offering a cost-effective alternative to experimental methods.

The paper tackles the problem of tissue heterogeneity in gene expression analysis by proposing an unsupervised method to deconvolute two-source mixed expressions, estimating both proportions and cell-specific profiles without prior knowledge, and demonstrates its performance on simulation and real data.

Tissue heterogeneity is a major confounding factor in studying individual populations that cannot be resolved directly by global profiling. Experimental solutions to mitigate tissue heterogeneity are expensive, time consuming, inapplicable to existing data, and may alter the original gene expression patterns. Here we ask whether it is possible to deconvolute two-source mixed expressions (estimating both proportions and cell-specific profiles) from two or more heterogeneous samples without requiring any prior knowledge. Supported by a well-grounded mathematical framework, we argue that both constituent proportions and cell-specific expressions can be estimated in a completely unsupervised mode when cell-specific marker genes exist, which do not have to be known a priori, for each of constituent cell types. We demonstrate the performance of unsupervised deconvolution on both simulation and real gene expression data, together with perspective discussions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes