AICLMay 30, 2025

Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning

arXiv:2505.24478v19 citationsh-index: 2
Originality Incremental advance
AI Analysis

This work addresses the underexplored issue of hyperparameter optimization in modular KG-LLM systems, which is incremental but important for improving retrieval-augmented generation in complex reasoning tasks.

The paper tackles the problem of systematically optimizing hyperparameters in systems that integrate Knowledge Graphs with Large Language Models for complex reasoning, showing that targeted tuning yields consistent but variable performance gains across three multi-hop QA benchmarks.

Integrating Large Language Models (LLMs) with Knowledge Graphs (KGs) results in complex systems with numerous hyperparameters that directly affect performance. While such systems are increasingly common in retrieval-augmented generation, the role of systematic hyperparameter optimization remains underexplored. In this paper, we study this problem in the context of Cognee, a modular framework for end-to-end KG construction and retrieval. Using three multi-hop QA benchmarks (HotPotQA, TwoWikiMultiHop, and MuSiQue) we optimize parameters related to chunking, graph construction, retrieval, and prompting. Each configuration is scored using established metrics (exact match, F1, and DeepEval's LLM-based correctness metric). Our results demonstrate that meaningful gains can be achieved through targeted tuning. While the gains are consistent, they are not uniform, with performance varying across datasets and metrics. This variability highlights both the value of tuning and the limitations of standard evaluation measures. While demonstrating the immediate potential of hyperparameter tuning, we argue that future progress will depend not only on architectural advances but also on clearer frameworks for optimization and evaluation in complex, modular systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes