CLApr 4, 2024

Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation

arXiv:2404.03196v131 citationsh-index: 6Has CodeNAACL
Originality Incremental advance
AI Analysis

This addresses cross-document event coreference for NLP applications, offering an incremental improvement through knowledge distillation without extra annotation.

The paper tackles event coreference resolution by using generated rationales from large language models to supervise smaller models, achieving state-of-the-art B3 F1 scores on ECB+ and GVC corpora and setting a new baseline on AIDA Phase 1.

In NLP, Event Coreference Resolution (ECR) is the task of connecting event clusters that refer to the same underlying real-life event, usually via neural systems. In this work, we investigate using abductive free-text rationales (FTRs) generated by modern autoregressive LLMs as distant supervision of smaller student models for cross-document coreference (CDCR) of events. We implement novel rationale-oriented event clustering and knowledge distillation methods for event coreference scoring that leverage enriched information from the FTRs for improved CDCR without additional annotation or expensive document clustering. Our model using coreference specific knowledge distillation achieves SOTA B3 F1 on the ECB+ and GVC corpora and we establish a new baseline on the AIDA Phase 1 corpus. Our code can be found at https://github.com/csu-signal/llama_cdcr

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes