CL AIAug 12, 2025

TEN: Table Explicitization, Neurosymbolically

Nikita Mehrotra, Aayush Kumar, Sumit Gulwani, Arjun Radhakrishna, Ashish Tiwari

arXiv:2508.09324v11 citationsh-index: 65

Originality Incremental advance

AI Analysis

This addresses the challenge of reliable table extraction for data processing applications, offering a hybrid solution to reduce errors in a domain-specific task.

The paper tackles the problem of extracting tabular data from semistructured text without consistent delimiters by proposing TEN, a neurosymbolic approach that combines LLM prompting with symbolic checking and self-debugging, resulting in significantly higher exact match accuracy and reduced hallucination rates compared to neural baselines, with user studies showing a mean accuracy score of 5.0 vs. 4.3 and over 60% preference.

We present a neurosymbolic approach, TEN, for extracting tabular data from semistructured input text. This task is particularly challenging for text input that does not use special delimiters consistently to separate columns and rows. Purely neural approaches perform poorly due to hallucinations and their inability to enforce hard constraints. TEN uses Structural Decomposition prompting - a specialized chain-of-thought prompting approach - on a large language model (LLM) to generate an initial table, and thereafter uses a symbolic checker to evaluate not only the well-formedness of that table, but also detect cases of hallucinations or forgetting. The output of the symbolic checker is processed by a critique-LLM to generate guidance for fixing the table, which is presented to the original LLM in a self-debug loop. Our extensive experiments demonstrate that TEN significantly outperforms purely neural baselines across multiple datasets and metrics, achieving significantly higher exact match accuracy and substantially reduced hallucination rates. A 21-participant user study further confirms that TEN's tables are rated significantly more accurate (mean score: 5.0 vs 4.3; p = 0.021), and are consistently preferred for ease of verification and correction, with participants favoring our method in over 60% of the cases.

View on arXiv PDF

Similar