LGGNNCOct 24, 2023

Compressed representation of brain genetic transcription

arXiv:2310.16113v3h-index: 41
Originality Synthesis-oriented
AI Analysis

This work provides a reference standard for representing brain transcription patterns, benefiting neuroscientists and researchers in computational biology by improving data analysis and interpretation.

The study tackled the challenge of compressing high-dimensional brain gene expression data into a navigable space by comparing linear and non-linear methods, finding that deep auto-encoders outperformed others in reconstruction fidelity, anatomical coherence, and predictive utility.

The architecture of the brain is too complex to be intuitively surveyable without the use of compressed representations that project its variation into a compact, navigable space. The task is especially challenging with high-dimensional data, such as gene expression, where the joint complexity of anatomical and transcriptional patterns demands maximum compression. Established practice is to use standard principal component analysis (PCA), whose computational felicity is offset by limited expressivity, especially at great compression ratios. Employing whole-brain, voxel-wise Allen Brain Atlas transcription data, here we systematically compare compressed representations based on the most widely supported linear and non-linear methods-PCA, kernel PCA, non-negative matrix factorization (NMF), t-stochastic neighbour embedding (t-SNE), uniform manifold approximation and projection (UMAP), and deep auto-encoding-quantifying reconstruction fidelity, anatomical coherence, and predictive utility with respect to signalling, microstructural, and metabolic targets. We show that deep auto-encoders yield superior representations across all metrics of performance and target domains, supporting their use as the reference standard for representing transcription patterns in the human brain.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes