CLApr 2, 2024

Self-Improvement Programming for Temporal Knowledge Graph Question Answering

ByteDance
arXiv:2404.01720v185 citationsh-index: 32LREC
Originality Incremental advance
AI Analysis

This work addresses the challenge of understanding complex temporal constraints in questions for TKGQA, representing an incremental advance by combining semantic parsing with LLM-based self-improvement.

The paper tackles the problem of answering questions with temporal constraints over Temporal Knowledge Graphs by introducing Prog-TQA, a self-improvement programming method that uses LLMs to generate and refine program drafts, achieving superior performance on benchmark datasets with notable Hits@1 metric improvements.

Temporal Knowledge Graph Question Answering (TKGQA) aims to answer questions with temporal intent over Temporal Knowledge Graphs (TKGs). The core challenge of this task lies in understanding the complex semantic information regarding multiple types of time constraints (e.g., before, first) in questions. Existing end-to-end methods implicitly model the time constraints by learning time-aware embeddings of questions and candidate answers, which is far from understanding the question comprehensively. Motivated by semantic-parsing-based approaches that explicitly model constraints in questions by generating logical forms with symbolic operators, we design fundamental temporal operators for time constraints and introduce a novel self-improvement Programming method for TKGQA (Prog-TQA). Specifically, Prog-TQA leverages the in-context learning ability of Large Language Models (LLMs) to understand the combinatory time constraints in the questions and generate corresponding program drafts with a few examples given. Then, it aligns these drafts to TKGs with the linking module and subsequently executes them to generate the answers. To enhance the ability to understand questions, Prog-TQA is further equipped with a self-improvement strategy to effectively bootstrap LLMs using high-quality self-generated drafts. Extensive experiments demonstrate the superiority of the proposed Prog-TQA on MultiTQ and CronQuestions datasets, especially in the Hits@1 metric.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes