DBCLMay 11

Toward Multi-Database Query Reasoning for Text2Cypher

arXiv:2605.1037318.2
Predicted impact top 17% in DB · last 90 daysOriginality Incremental advance
AI Analysis

For researchers and practitioners building natural language interfaces to graph databases, this work formalizes a more realistic multi-database scenario, but it is a conceptual proposal without empirical validation.

The paper identifies the limitation of existing Text2Cypher systems that assume a single preselected graph database, and proposes a shift to multi-database query reasoning where the system must route across databases, decompose questions, and integrate results. It provides a structured formulation and roadmap for this new setting.

Large language models have significantly improved natural language interfaces to databases by translating user questions into executable queries. In particular, Text2Cypher focuses on generating Cypher queries for graph databases, enabling users to access graph data without query language expertise. Most existing Text2Cypher systems assume a single preselected graph database, where queries are generated over a known schema. However, real-world systems are often distributed across multiple independent graph databases organized by domain or system boundaries, where relevant information may span multiple sources. To address this limitation, we propose a shift from single-database query generation to multi-database query reasoning. Instead of assuming a fixed execution context, the system must reason about (i) relevant databases, (ii) how to decompose a question across them, and (iii) how to integrate partial results. We formalize this setting through a three-phase roadmap: database routing, multi-database decomposition, and heterogeneous query reasoning across database types and query languages. This work provides a structured formulation of multi-database reasoning for Text2Cypher and identifies challenges in source selection, query decomposition, and result integration, aiming to support more realistic and scalable natural language interfaces to graph databases.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes