AICLJun 10, 2022

Graph-in-Graph Network for Automatic Gene Ontology Description Generation

Oxford
arXiv:2206.05311v27 citationsh-index: 35
Originality Incremental advance
AI Analysis

This addresses a gap in biomedical knowledge bases by enabling automated description generation for new GO terms, which is an incremental advancement over existing gene-term association prediction tasks.

The paper tackles the novel task of automatically generating descriptions for Gene Ontology terms by proposing a Graph-in-Graph network that leverages structural information, resulting in up to 34.7%, 14.5%, and 39.1% relative improvements in BLEU, ROUGE-L, and METEOR scores across seven sequence-to-sequence models.

Gene Ontology (GO) is the primary gene function knowledge base that enables computational tasks in biomedicine. The basic element of GO is a term, which includes a set of genes with the same function. Existing research efforts of GO mainly focus on predicting gene term associations. Other tasks, such as generating descriptions of new terms, are rarely pursued. In this paper, we propose a novel task: GO term description generation. This task aims to automatically generate a sentence that describes the function of a GO term belonging to one of the three categories, i.e., molecular function, biological process, and cellular component. To address this task, we propose a Graph-in-Graph network that can efficiently leverage the structural information of GO. The proposed network introduces a two-layer graph: the first layer is a graph of GO terms where each node is also a graph (gene graph). Such a Graph-in-Graph network can derive the biological functions of GO terms and generate proper descriptions. To validate the effectiveness of the proposed network, we build three large-scale benchmark datasets. By incorporating the proposed Graph-in-Graph network, the performances of seven different sequence-to-sequence models can be substantially boosted across all evaluation metrics, with up to 34.7%, 14.5%, and 39.1% relative improvements in BLEU, ROUGE-L, and METEOR, respectively.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes