AIDec 2, 2024

LLMs4Life: Large Language Models for Ontology Learning in Life Sciences

Nadeen Fathallah, Steffen Staab, Alsayed Algergawy

arXiv:2412.02035v114.717 citationsh-index: 15Has CodeEKAW

Originality Incremental advance

AI Analysis

This work addresses ontology learning for life science researchers, but it is incremental as it builds on the NeOn-GPT pipeline with prompt engineering and reuse.

The paper tackled the challenge of using Large Language Models (LLMs) for ontology learning in life sciences, where existing models struggle with hierarchical depth and domain adaptation, and demonstrated viability by evaluating on the AquaDiva ontology case study.

Ontology learning in complex domains, such as life sciences, poses significant challenges for current Large Language Models (LLMs). Existing LLMs struggle to generate ontologies with multiple hierarchical levels, rich interconnections, and comprehensive class coverage due to constraints on the number of tokens they can generate and inadequate domain adaptation. To address these issues, we extend the NeOn-GPT pipeline for ontology learning using LLMs with advanced prompt engineering techniques and ontology reuse to enhance the generated ontologies' domain-specific reasoning and structural depth. Our work evaluates the capabilities of LLMs in ontology learning in the context of highly specialized and complex domains such as life science domains. To assess the logical consistency, completeness, and scalability of the generated ontologies, we use the AquaDiva ontology developed and used in the collaborative research center AquaDiva as a case study. Our evaluation shows the viability of LLMs for ontology learning in specialized domains, providing solutions to longstanding limitations in model performance and scalability.

View on arXiv PDF Code

Similar