CL AIAug 6, 2024

Enhancing Complex Causality Extraction via Improved Subtask Interaction and Knowledge Fusion

Jinglong Gao, Chen Lu, Xiao Ding, Zhongyang Li, Ting Liu, Bing Qin

arXiv:2408.03079v13.45 citationsh-index: 16

Originality Incremental advance

AI Analysis

This work solves the problem of extracting causal event pairs from texts for natural language processing applications, representing an incremental advancement by integrating existing methods into a unified framework.

The paper tackled the problem of Event Causality Extraction (ECE) by addressing three key challenges: complex causality extraction, subtask interaction, and knowledge fusion, resulting in a unified framework that achieved state-of-the-art performance with at least a 30% F1-score improvement over ChatGPT on benchmark datasets.

Event Causality Extraction (ECE) aims at extracting causal event pairs from texts. Despite ChatGPT's recent success, fine-tuning small models remains the best approach for the ECE task. However, existing fine-tuning based ECE methods cannot address all three key challenges in ECE simultaneously: 1) Complex Causality Extraction, where multiple causal-effect pairs occur within a single sentence; 2) Subtask~ Interaction, which involves modeling the mutual dependence between the two subtasks of ECE, i.e., extracting events and identifying the causal relationship between extracted events; and 3) Knowledge Fusion, which requires effectively fusing the knowledge in two modalities, i.e., the expressive pretrained language models and the structured knowledge graphs. In this paper, we propose a unified ECE framework (UniCE to address all three issues in ECE simultaneously. Specifically, we design a subtask interaction mechanism to enable mutual interaction between the two ECE subtasks. Besides, we design a knowledge fusion mechanism to fuse knowledge in the two modalities. Furthermore, we employ separate decoders for each subtask to facilitate complex causality extraction. Experiments on three benchmark datasets demonstrate that our method achieves state-of-the-art performance and outperforms ChatGPT with a margin of at least 30% F1-score. More importantly, our model can also be used to effectively improve the ECE performance of ChatGPT via in-context learning.

View on arXiv PDF

Similar