QMLGMNDec 13, 2016

Neuro-symbolic representation learning on biological knowledge graphs

arXiv:1612.04256v1149 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of applying machine learning to structured biological knowledge, potentially enabling broader use of Semantic Web-based knowledge bases in biology, though it appears incremental as it builds on existing graph learning and neuro-symbolic paradigms.

The authors tackled the problem of feature learning on biological knowledge graphs by developing a neuro-symbolic method that combines symbolic logic with neural networks to generate node embeddings, demonstrating performance that matches or outperforms traditional manually crafted feature-based approaches in tasks like function prediction and disease gene identification.

Motivation: Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. In the past years, feature learning methods that are applicable to graph-structured data are becoming available, but have not yet widely been applied and evaluated on structured biological knowledge. Results: We develop a novel method for feature learning on biological knowledge graphs. Our method combines symbolic methods, in particular knowledge representation using symbolic logic and automated reasoning, with neural networks to generate embeddings of nodes that encode for related information within knowledge graphs. Through the use of symbolic logic, these embeddings contain both explicit and implicit information. We apply these embeddings to the prediction of edges in the knowledge graph representing problems of function prediction, finding candidate genes of diseases, protein-protein interactions, or drug target relations, and demonstrate performance that matches and sometimes outperforms traditional approaches based on manually crafted features. Our method can be applied to any biological knowledge graph, and will thereby open up the increasing amount of Semantic Web based knowledge bases in biology to use in machine learning and data analytics. Availability and Implementation: https://github.com/bio-ontology-research-group/walking-rdf-and-owl Contact: robert.hoehndorf@kaust.edu.sa

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes