MLLGCOJan 17, 2021

Multi-view Data Visualisation via Manifold Learning

arXiv:2101.06763v314 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of integrating multiple data types for visualization and clustering, particularly in domains like multi-omics single-cell analysis, but it is incremental as it builds on existing manifold learning techniques.

The paper tackled the problem of visualizing multi-view data by extending manifold learning methods like t-SNE, LLE, and ISOMAP, resulting in more comprehensible projections and accurate cluster identification when combined with K-means clustering, with multi-SNE performing best on real and synthetic data.

Non-linear dimensionality reduction can be performed by \textit{manifold learning} approaches, such as Stochastic Neighbour Embedding (SNE), Locally Linear Embedding (LLE) and Isometric Feature Mapping (ISOMAP). These methods aim to produce two or three latent embeddings, primarily to visualise the data in intelligible representations. This manuscript proposes extensions of Student's t-distributed SNE (t-SNE), LLE and ISOMAP, for dimensionality reduction and visualisation of multi-view data. Multi-view data refers to multiple types of data generated from the same samples. The proposed multi-view approaches provide more comprehensible projections of the samples compared to the ones obtained by visualising each data-view separately. Commonly visualisation is used for identifying underlying patterns within the samples. By incorporating the obtained low-dimensional embeddings from the multi-view manifold approaches into the K-means clustering algorithm, it is shown that clusters of the samples are accurately identified. Through the analysis of real and synthetic data the proposed multi-SNE approach is found to have the best performance. We further illustrate the applicability of the multi-SNE approach for the analysis of multi-omics single-cell data, where the aim is to visualise and identify cell heterogeneity and cell types in biological tissues relevant to health and disease.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes