CLFeb 20, 2023

ChatIE: Zero-Shot Information Extraction via Chatting with ChatGPT

arXiv:2302.10205v2490 citationsh-index: 32
AI Analysis

This work addresses the problem of reducing data labeling effort for information extraction, though it is incremental as it builds on existing prompt-based methods with LLMs.

The authors tackled zero-shot information extraction by transforming it into a multi-turn question-answering problem using ChatGPT, achieving impressive performance that surpassed some full-shot models on datasets like NYT11-HRL.

Zero-shot information extraction (IE) aims to build IE systems from the unannotated text. It is challenging due to involving little human intervention. Challenging but worthwhile, zero-shot IE reduces the time and effort that data labeling takes. Recent efforts on large language models (LLMs, e.g., GPT-3, ChatGPT) show promising performance on zero-shot settings, thus inspiring us to explore prompt-based methods. In this work, we ask whether strong IE models can be constructed by directly prompting LLMs. Specifically, we transform the zero-shot IE task into a multi-turn question-answering problem with a two-stage framework (ChatIE). With the power of ChatGPT, we extensively evaluate our framework on three IE tasks: entity-relation triple extract, named entity recognition, and event extraction. Empirical results on six datasets across two languages show that ChatIE achieves impressive performance and even surpasses some full-shot models on several datasets (e.g., NYT11-HRL). We believe that our work could shed light on building IE models with limited resources.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes