CLJun 2, 2025

Schema as Parameterized Tools for Universal Information Extraction

arXiv:2506.01276v1h-index: 12
Originality Incremental advance
AI Analysis

This addresses a bottleneck in UIE for researchers and practitioners by improving adaptability across tasks, though it appears incremental as it builds on existing tool-calling paradigms.

The paper tackles the lack of adaptability in universal information extraction (UIE) when selecting between predefined schemas and on-the-fly generation, proposing a unified framework called Schema as Parameterized Tools (SPT) that achieves comparable extraction performance to leading systems with significantly fewer trainable parameters.

Universal information extraction (UIE) primarily employs an extractive generation approach with large language models (LLMs), typically outputting structured information based on predefined schemas such as JSON or tables. UIE suffers from a lack of adaptability when selecting between predefined schemas and on-the-fly schema generation within the in-context learning paradigm, especially when there are numerous schemas to choose from. In this paper, we propose a unified adaptive text-to-structure generation framework, called Schema as Parameterized Tools (SPT), which reimagines the tool-calling capability of LLMs by treating predefined schemas as parameterized tools for tool selection and parameter filling. Specifically, our SPT method can be applied to unify closed, open, and on-demand IE tasks by adopting Schema Retrieval by fetching the relevant schemas from a predefined pool, Schema Filling by extracting information and filling slots as with tool parameters, or Schema Generation by synthesizing new schemas with uncovered cases. Experiments show that the SPT method can handle four distinct IE tasks adaptively, delivering robust schema retrieval and selection performance. SPT also achieves comparable extraction performance to LoRA baselines and current leading UIE systems with significantly fewer trainable parameters.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes