CLAILGSep 23, 2021

Zero-Shot Information Extraction as a Unified Text-to-Triple Translation

arXiv:2109.11171v1667 citations
Originality Incremental advance
AI Analysis

This work addresses the need for task-agnostic models in information extraction, reducing reliance on labeled datasets, though it is incremental in building on pre-trained language models.

The authors tackled the problem of information extraction by unifying multiple tasks into a text-to-triple translation framework, enabling zero-shot transfer to tasks like open information extraction and relation classification, often achieving competitive performance with supervised methods without task-specific training, such as significantly outperforming supervised F1 scores in open information extraction.

We cast a suite of information extraction tasks into a text-to-triple translation framework. Instead of solving each task relying on task-specific datasets and models, we formalize the task as a translation between task-specific input text and output triples. By taking the task-specific input, we enable a task-agnostic translation by leveraging the latent knowledge that a pre-trained language model has about the task. We further demonstrate that a simple pre-training task of predicting which relational information corresponds to which input text is an effective way to produce task-specific outputs. This enables the zero-shot transfer of our framework to downstream tasks. We study the zero-shot performance of this framework on open information extraction (OIE2016, NYT, WEB, PENN), relation classification (FewRel and TACRED), and factual probe (Google-RE and T-REx). The model transfers non-trivially to most tasks and is often competitive with a fully supervised method without the need for any task-specific training. For instance, we significantly outperform the F1 score of the supervised open information extraction without needing to use its training set.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes