CLFeb 20, 2023

ChatIE: Zero-Shot Information Extraction via Chatting with ChatGPT

Xiang Wei, Xingyu Cui, Ning Cheng, Xiaobin Wang, Xin Zhang, Shen Huang, Pengjun Xie, Jinan Xu, Yufeng Chen, Meishan Zhang, Yong Jiang, Wenjuan Han

arXiv:2302.10205v227.1500 citationsh-index: 32Has Code

Originality Incremental advance

AI Analysis

This work addresses the problem of reducing data labeling effort for information extraction, though it is incremental as it builds on existing prompt-based methods with LLMs.

The authors tackled zero-shot information extraction by transforming it into a multi-turn question-answering problem using ChatGPT, achieving impressive performance that surpassed some full-shot models on datasets like NYT11-HRL.

Zero-shot information extraction (IE) aims to build IE systems from the unannotated text. It is challenging due to involving little human intervention. Challenging but worthwhile, zero-shot IE reduces the time and effort that data labeling takes. Recent efforts on large language models (LLMs, e.g., GPT-3, ChatGPT) show promising performance on zero-shot settings, thus inspiring us to explore prompt-based methods. In this work, we ask whether strong IE models can be constructed by directly prompting LLMs. Specifically, we transform the zero-shot IE task into a multi-turn question-answering problem with a two-stage framework (ChatIE). With the power of ChatGPT, we extensively evaluate our framework on three IE tasks: entity-relation triple extract, named entity recognition, and event extraction. Empirical results on six datasets across two languages show that ChatIE achieves impressive performance and even surpasses some full-shot models on several datasets (e.g., NYT11-HRL). We believe that our work could shed light on building IE models with limited resources.

View on arXiv PDF Code

Similar