CLAIIRJun 1, 2023

TimelineQA: A Benchmark for Question Answering over Timelines

arXiv:2306.01069v1230 citationsh-index: 100
Originality Synthesis-oriented
AI Analysis

This addresses the problem of querying lifelogs for personal assistants, but it is incremental as it builds on existing QA techniques with a new benchmark.

The authors introduced TimelineQA, a benchmark for question answering over lifelogs, and found that for atomic queries, an extractive QA system outperformed a retrieval-augmented system, while for multi-hop queries, a table QA technique performed best when ground truth episodes were available.

Lifelogs are descriptions of experiences that a person had during their life. Lifelogs are created by fusing data from the multitude of digital services, such as online photos, maps, shopping and content streaming services. Question answering over lifelogs can offer personal assistants a critical resource when they try to provide advice in context. However, obtaining answers to questions over lifelogs is beyond the current state of the art of question answering techniques for a variety of reasons, the most pronounced of which is that lifelogs combine free text with some degree of structure such as temporal and geographical information. We create and publicly release TimelineQA1, a benchmark for accelerating progress on querying lifelogs. TimelineQA generates lifelogs of imaginary people. The episodes in the lifelog range from major life episodes such as high school graduation to those that occur on a daily basis such as going for a run. We describe a set of experiments on TimelineQA with several state-of-the-art QA models. Our experiments reveal that for atomic queries, an extractive QA system significantly out-performs a state-of-the-art retrieval-augmented QA system. For multi-hop queries involving aggregates, we show that the best result is obtained with a state-of-the-art table QA technique, assuming the ground truth set of episodes for deriving the answer is available.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes