CLAIMar 28, 2025

FRASE: Structured Representations for Generalizable SPARQL Query Generation

arXiv:2503.22144v1h-index: 2
Originality Incremental advance
AI Analysis

This addresses the generalization issue in SPARQL query generation for Knowledge Base systems, though it is incremental as it builds on existing methods with a novel enhancement.

The paper tackles the problem of translating natural language questions into SPARQL queries for Knowledge Base querying, where existing models fail to generalize due to template-based datasets, and introduces FRASE with frame semantic role labeling to improve performance, showing consistent gains in generalization scenarios like unseen templates and reformulated questions.

Translating natural language questions into SPARQL queries enables Knowledge Base querying for factual and up-to-date responses. However, existing datasets for this task are predominantly template-based, leading models to learn superficial mappings between question and query templates rather than developing true generalization capabilities. As a result, models struggle when encountering naturally phrased, template-free questions. This paper introduces FRASE (FRAme-based Semantic Enhancement), a novel approach that leverages Frame Semantic Role Labeling (FSRL) to address this limitation. We also present LC-QuAD 3.0, a new dataset derived from LC-QuAD 2.0, in which each question is enriched using FRASE through frame detection and the mapping of frame-elements to their argument. We evaluate the impact of this approach through extensive experiments on recent large language models (LLMs) under different fine-tuning configurations. Our results demonstrate that integrating frame-based structured representations consistently improves SPARQL generation performance, particularly in challenging generalization scenarios when test questions feature unseen templates (unknown template splits) and when they are all naturally phrased (reformulated questions).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes