LG AIOct 9, 2025

Inverse-Free Wilson Loops for Transformers: A Practical Diagnostic for Invariance and Order Sensitivity

arXiv:2510.08648v14.11 citationsh-index: 1

Originality Incremental advance

AI Analysis

This addresses reliability issues in deployed AI systems for practitioners, offering a practical tool to detect and mitigate invariance failures without retraining.

The paper tackles the problem of large language models changing answers under harmless edits like reordering, which violates intended invariances and breaks continuous integration, by presenting WILSON, a post-hoc diagnostic suite that converts loop and reordering checks into system signals, enabling actions such as guarding RAG against order effects and catching fine-tuning regressions.

Large language models can change answers under harmless edits that matter in practice: RAG outputs flip when passages are reordered, fine-tuning erodes invariances learned at pretraining, debate or chain-of-thought prompts take path-dependent routes, and compiler fusion or reordering perturbs logits near decision boundaries. These failures violate intended invariances, break continuous integration, and force teams to trade safety for speed. The effects are small yet distributed across layers and positions, sensitive to context length and evaluation order, and costly to repair with retraining or formal verification. We present WILSON, a minimal post-hoc diagnostic suite that converts simple loop and reordering checks on internal representations into system signals. WILSON combines an inverse-free curvature map over positions and layers, computed with JVPs and Hutchinson probes, with activation-level commutators that flag reorder risk. Signals are cheap to compute, model-agnostic for standard Transformers, and exported as thresholds and CSV artifacts for orchestrators. This enables concrete actions: guard RAG against order effects, catch fine-tuning regressions, stabilize debate pathways and long multi-turn contexts, and gate fusions or reorders in deployment. In short, WILSON helps anticipate failures and approve safe optimizations so reliability and throughput can improve together without changing model architecture or training.

View on arXiv PDF

Similar