AI LODec 19, 2025

Eidoku: A Neuro-Symbolic Verification Gate for LLM Reasoning via Structural Constraint Satisfaction

arXiv:2512.20664v1

Originality Highly original

AI Analysis

This work addresses the issue of hallucination in LLMs for users relying on reliable reasoning, though it is incremental as it focuses on a specific class of hallucinations in controlled settings.

The paper tackles the problem of LLMs producing hallucinated statements that are structurally inconsistent but statistically probable, by introducing a neuro-symbolic verification gate that models verification as a constraint satisfaction problem based on structural violation cost. The result is a method that deterministically rejects 'smooth falsehoods' in a controlled diagnostic dataset, addressing a limitation of probability-based verifiers.

Large Language Models (LLMs) frequently produce hallucinated statements that are assigned high likelihood by the model itself, exposing a fundamental limitation of probability-based verification. This suggests that hallucination is often not a low-confidence phenomenon, but a failure of structural consistency. In this work, we reformulate the verification of LLM reasoning as a Constraint Satisfaction Problem (CSP) operating independently of the generation likelihood. Rather than optimizing for statistical plausibility, we model verification as a feasibility check based on structural violation cost -- the computational cost required to embed a candidate reasoning step into the contextual graph structure. We define a total cost function composed of three proxies: (i) graph connectivity (structural), (ii) feature space consistency (geometric), and (iii) logical entailment (symbolic). Crucially, verification is performed via a lightweight System-2 gate, Eidoku, which rejects candidates exceeding a context-calibrated cost threshold. The threshold is not learned but is derived from the intrinsic statistics of the context, avoiding ad hoc heuristics. We demonstrate that this approach successfully rejects ``smooth falsehoods'' -- statements that are highly probable yet structurally disconnected -- that probability-based verifiers are principally incapable of detecting. Our experiments on a controlled diagnostic dataset show that explicitly enforcing structural constraints allows for the deterministic rejection of this specific class of hallucinations, serving as a neuro-symbolic sanity check for generative reasoning.

View on arXiv PDF

Similar