Genomic Interpreter: A Hierarchical Genomic Deep Neural Network with 1D Shifted Window Transformer
This addresses the need for interpretable machine-learning models in genomics to extract insights from large-scale data, though it appears incremental as it builds on existing Transformer-based methods.
The paper tackles the problem of predicting genomic assays like chromatin accessibility and gene expression by introducing Genomic Interpreter, a novel architecture that outperforms state-of-the-art models on a dataset of 38,171 DNA segments.
Given the increasing volume and quality of genomics data, extracting new insights requires interpretable machine-learning models. This work presents Genomic Interpreter: a novel architecture for genomic assay prediction. This model outperforms the state-of-the-art models for genomic assay prediction tasks. Our model can identify hierarchical dependencies in genomic sites. This is achieved through the integration of 1D-Swin, a novel Transformer-based block designed by us for modelling long-range hierarchical data. Evaluated on a dataset containing 38,171 DNA segments of 17K base pairs, Genomic Interpreter demonstrates superior performance in chromatin accessibility and gene expression prediction and unmasks the underlying `syntax' of gene regulation.