GNLGApr 26, 2023

UNADON: Transformer-based model to predict genome-wide chromosome spatial position

arXiv:2304.13230v23 citationsh-index: 6
Originality Highly original
AI Analysis

This work addresses the challenge of understanding chromatin spatial organization for genomics and nuclear biology, representing an incremental advance with a novel method for a known bottleneck.

The researchers tackled the problem of predicting genome-wide chromosome spatial positioning relative to nuclear bodies by developing UNADON, a transformer-based model that uses sequence and epigenomic features, achieving high accuracy in four cell lines and in an unseen cell type.

The spatial positioning of chromosomes relative to functional nuclear bodies is intertwined with genome functions such as transcription. However, the sequence patterns and epigenomic features that collectively influence chromatin spatial positioning in a genome-wide manner are not well understood. Here, we develop a new transformer-based deep learning model called UNADON, which predicts the genome-wide cytological distance to a specific type of nuclear body, as measured by TSA-seq, using both sequence features and epigenomic signals. Evaluations of UNADON in four cell lines (K562, H1, HFFc6, HCT116) show high accuracy in predicting chromatin spatial positioning to nuclear bodies when trained on a single cell line. UNADON also performed well in an unseen cell type. Importantly, we reveal potential sequence and epigenomic factors that affect large-scale chromatin compartmentalization to nuclear bodies. Together, UNADON provides new insights into the principles between sequence features and large-scale chromatin spatial localization, which has important implications for understanding nuclear structure and function.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes