CLHCLGOct 11, 2022

Rethinking the Event Coding Pipeline with Prompt Entailment

ETH Zurich
arXiv:2210.05257v2261 citationsh-index: 13
Originality Incremental advance
AI Analysis

This addresses the challenge for low-resource humanitarian aid organizations in managing unstructured event data, offering a more accessible method for event coding.

The authors tackled the problem of classifying political events from news for crisis monitoring by proposing PR-ENT, a flexible and resource-efficient event coding approach that uses prompt entailment with pre-trained language models, achieving competitive accuracy without requiring large annotated datasets.

For monitoring crises, political events are extracted from the news. The large amount of unstructured full-text event descriptions makes a case-by-case analysis unmanageable, particularly for low-resource humanitarian aid organizations. This creates a demand to classify events into event types, a task referred to as event coding. Typically, domain experts craft an event type ontology, annotators label a large dataset and technical experts develop a supervised coding system. In this work, we propose PR-ENT, a new event coding approach that is more flexible and resource-efficient, while maintaining competitive accuracy: first, we extend an event description such as "Military injured two civilians'' by a template, e.g. "People were [Z]" and prompt a pre-trained (cloze) language model to fill the slot Z. Second, we select answer candidates Z* = {"injured'', "hurt"...} by treating the event description as premise and the filled templates as hypothesis in a textual entailment task. This allows domain experts to draft the codebook directly as labeled prompts and interpretable answer candidates. This human-in-the-loop process is guided by our interactive codebook design tool. We evaluate PR-ENT in several robustness checks: perturbing the event description and prompt template, restricting the vocabulary and removing contextual information.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes