CLMay 23, 2025

Model Editing with Graph-Based External Memory

Yash Kumar Atri, Ahmed Alaa, Thomas Hartvigsen

arXiv:2505.18343v14.91 citationsh-index: 8ACL

Originality Highly original

AI Analysis

This addresses the issue of dynamic updates for LLMs to reduce errors and forgetting, offering a novel method for precise edits, though it is incremental in improving existing editing techniques.

The paper tackled the problem of hallucinations and outdated knowledge in large language models by proposing HYPE, a framework using hyperbolic geometry and graph neural networks for model editing, which significantly improved edit stability, factual accuracy, and multi-hop reasoning in experiments on datasets like CounterFact and MQuAKE with models such as GPT-J and GPT2-XL.

Large language models (LLMs) have revolutionized natural language processing, yet their practical utility is often limited by persistent issues of hallucinations and outdated parametric knowledge. Although post-training model editing offers a pathway for dynamic updates, existing methods frequently suffer from overfitting and catastrophic forgetting. To tackle these challenges, we propose a novel framework that leverages hyperbolic geometry and graph neural networks for precise and stable model edits. We introduce HYPE (HYperbolic Parameter Editing), which comprises three key components: (i) Hyperbolic Graph Construction, which uses Poincaré embeddings to represent knowledge triples in hyperbolic space, preserving hierarchical relationships and preventing unintended side effects by ensuring that edits to parent concepts do not inadvertently affect child concepts; (ii) Möbius-Transformed Updates, which apply hyperbolic addition to propagate edits while maintaining structural consistency within the hyperbolic manifold, unlike conventional Euclidean updates that distort relational distances; and (iii) Dual Stabilization, which combines gradient masking and periodic GNN parameter resetting to prevent catastrophic forgetting by focusing updates on critical parameters and preserving long-term knowledge. Experiments on CounterFact, CounterFact+, and MQuAKE with GPT-J and GPT2-XL demonstrate that HYPE significantly enhances edit stability, factual accuracy, and multi-hop reasoning.

View on arXiv PDF

Similar