CL AI LGSep 9, 2024

Logically Consistent Language Models via Neuro-Symbolic Integration

Diego Calanzone, Stefano Teso, Antonio Vergari

arXiv:2409.13724v111.930 citationsh-index: 23

Originality Incremental advance

AI Analysis

This work addresses the issue of unreliable reasoning in LLMs for applications requiring factual accuracy and logical coherence, representing an incremental improvement over existing methods.

The authors tackled the problem of logical inconsistency in large language models by introducing a neuro-symbolic reasoning loss that improves self-consistency with external facts and rules, even with limited fine-tuning data, and enhances extrapolation to unseen datasets.

Large language models (LLMs) are a promising venue for natural language understanding and generation. However, current LLMs are far from reliable: they are prone to generating non-factual information and, more crucially, to contradicting themselves when prompted to reason about relations between entities of the world. These problems are currently addressed with large scale fine-tuning or by delegating reasoning to external tools. In this work, we strive for a middle ground and introduce a loss based on neuro-symbolic reasoning that teaches an LLM to be logically consistent with an external set of facts and rules and improves self-consistency even when the LLM is fine-tuned on a limited set of facts. Our approach also allows to easily combine multiple logical constraints at once in a principled way, delivering LLMs that are more consistent w.r.t. all constraints and improve over several baselines w.r.t. a given constraint. Moreover, our method allows LLMs to extrapolate to unseen but semantically similar factual knowledge, represented in unseen datasets, more systematically.

View on arXiv PDF

Similar