CLJan 15

SocraticKG: Knowledge Graph Construction via QA-Driven Fact Extraction

arXiv:2601.10003v1h-index: 1
Originality Incremental advance
AI Analysis

This addresses the problem of building coherent knowledge graphs from unstructured text for applications in knowledge representation and reasoning, representing an incremental improvement over existing LLM-based methods.

The paper tackled the trade-off between factual coverage and relational fragmentation in knowledge graph construction from text by proposing SocraticKG, which uses QA-driven fact extraction as an intermediate step, achieving superior factual retention and structural cohesion on the MINE benchmark.

Constructing Knowledge Graphs (KGs) from unstructured text provides a structured framework for knowledge representation and reasoning, yet current LLM-based approaches struggle with a fundamental trade-off: factual coverage often leads to relational fragmentation, while premature consolidation causes information loss. To address this, we propose SocraticKG, an automated KG construction method that introduces question-answer pairs as a structured intermediate representation to systematically unfold document-level semantics prior to triple extraction. By employing 5W1H-guided QA expansion, SocraticKG captures contextual dependencies and implicit relational links typically lost in direct KG extraction pipelines, providing explicit grounding in the source document that helps mitigate implicit reasoning errors. Evaluation on the MINE benchmark demonstrates that our approach effectively addresses the coverage-connectivity trade-off, achieving superior factual retention while maintaining high structural cohesion even as extracted knowledge volume substantially expands. These results highlight that QA-mediated semantic scaffolding plays a critical role in structuring semantics prior to KG extraction, enabling more coherent and reliable graph construction in subsequent stages.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes