CL AISep 8, 2022

IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model

Martin Fajcik, Muskaan Singh, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Pavel Smrz

arXiv:2209.03891v223.9291 citationsh-index: 31Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses event causality identification in news media, which is incremental as it applies an existing method to a specific shared task with limited data.

The paper tackled the problem of automatically detecting cause-effect-signal spans in news sentences by using a T5 pre-trained autoregressive language model with iterative conditioning on previously predicted triplets, achieving second place in a competition with competitive performance despite training on only 160 samples.

In this paper, we describe our shared task submissions for Subtask 2 in CASE-2022, Event Causality Identification with Casual News Corpus. The challenge focused on the automatic detection of all cause-effect-signal spans present in the sentence from news-media. We detect cause-effect-signal spans in a sentence using T5 -- a pre-trained autoregressive language model. We iteratively identify all cause-effect-signal span triplets, always conditioning the prediction of the next triplet on the previously predicted ones. To predict the triplet itself, we consider different causal relationships such as cause$\rightarrow$effect$\rightarrow$signal. Each triplet component is generated via a language model conditioned on the sentence, the previous parts of the current triplet, and previously predicted triplets. Despite training on an extremely small dataset of 160 samples, our approach achieved competitive performance, being placed second in the competition. Furthermore, we show that assuming either cause$\rightarrow$effect or effect$\rightarrow$cause order achieves similar results.

View on arXiv PDF Code

Similar