CLMar 1, 2024

Formulation Comparison for Timeline Construction using LLMs

arXiv:2403.00990v13 citationsh-index: 4Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses timeline construction for natural language processing applications, but it is incremental as it focuses on dataset creation and formulation comparison.

The authors tackled the problem of timeline construction by developing a new dataset, TimeSET, with document-level order annotation to address missing temporal information in prior datasets, and found that NLI formulation with Flan-T5 performed strongly while the tasks remained challenging for few-shot LLMs.

Constructing a timeline requires identifying the chronological order of events in an article. In prior timeline construction datasets, temporal orders are typically annotated by either event-to-time anchoring or event-to-event pairwise ordering, both of which suffer from missing temporal information. To mitigate the issue, we develop a new evaluation dataset, TimeSET, consisting of single-document timelines with document-level order annotation. TimeSET features saliency-based event selection and partial ordering, which enable a practical annotation workload. Aiming to build better automatic timeline construction systems, we propose a novel evaluation framework to compare multiple task formulations with TimeSET by prompting open LLMs, i.e., Llama 2 and Flan-T5. Considering that identifying temporal orders of events is a core subtask in timeline construction, we further benchmark open LLMs on existing event temporal ordering datasets to gain a robust understanding of their capabilities. Our experiments show that (1) NLI formulation with Flan-T5 demonstrates a strong performance among others, while (2) timeline construction and event temporal ordering are still challenging tasks for few-shot LLMs. Our code and data are available at https://github.com/kimihiroh/timeset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes