CLOct 14, 2025

Information Extraction from Conversation Transcripts: Neuro-Symbolic vs. LLM

Alice Saebom Kwak, Maria Alexeeva, Gus Hahn-Powell, Keith Alcock, Kevin McLaughlin, Doug McCorkle, Gabe McNunn, Mihai Surdeanu

arXiv:2510.12023v12.7h-index: 10

Originality Synthesis-oriented

AI Analysis

It addresses the trade-offs in deploying NLP systems for real-world applications, highlighting performance, efficiency, and control issues, but is incremental as it compares existing methods on new data.

This paper compared a neuro-symbolic and an LLM-based information extraction system in agriculture, finding that the LLM-based system outperformed the neuro-symbolic one with F1 scores of 69.4 vs. 52.7 for total extraction and 63.0 vs. 47.2 for core details.

The current trend in information extraction (IE) is to rely extensively on large language models, effectively discarding decades of experience in building symbolic or statistical IE systems. This paper compares a neuro-symbolic (NS) and an LLM-based IE system in the agricultural domain, evaluating them on nine interviews across pork, dairy, and crop subdomains. The LLM-based system outperforms the NS one (F1 total: 69.4 vs. 52.7; core: 63.0 vs. 47.2), where total includes all extracted information and core focuses on essential details. However, each system has trade-offs: the NS approach offers faster runtime, greater control, and high accuracy in context-free tasks but lacks generalizability, struggles with contextual nuances, and requires significant resources to develop and maintain. The LLM-based system achieves higher performance, faster deployment, and easier maintenance but has slower runtime, limited control, model dependency and hallucination risks. Our findings highlight the "hidden cost" of deploying NLP systems in real-world applications, emphasizing the need to balance performance, efficiency, and control.

View on arXiv PDF

Similar