LGAICLJun 24, 2024

How Data Inter-connectivity Shapes LLMs Unlearning: A Structural Unlearning Perspective

arXiv:2406.16810v22 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of unlearning in LLMs for real-world applications where data is interconnected, though it is incremental as it builds on existing unlearning methods by focusing on structural aspects.

The paper tackles the problem of unlearning knowledge from large language models (LLMs) by investigating how data inter-connectivity affects unlearning effectiveness, showing that unlearning difficulty increases with greater inter-connectivity and knowledge graph density.

While unlearning knowledge from large language models (LLMs) is receiving increasing attention, one important aspect remains unexplored. Existing approaches and benchmarks assume data points to-be-forgotten are independent, ignoring their inter-connectivity - a fundamental characteristic of real-world data structures. In this paper, we propose PISTOL, a method for compiling structural datasets. PISTOL leverages the inherently structured nature of contractual relationships, offering several key benefits. First, it enables insights into the impact of structural data on unlearning effectiveness. Second, it provides precise and concise ground truths for clearer evaluation. Third, its attribute generation does not require input from pre-trained LLMs, mitigating confounding risks. Leveraging datasets synthesized using PISTOL, we demonstrate how data inter-connectivity impacts LLM unlearning. Specifically, (a) in both the pre-trained and fine-tuned models, unlearning difficulty increases as data inter-connectivity grows, (b) there is a positive correlation between the density of the knowledge graph and unlearning difficulty, and (c) when the to-be-forgotten data is skewed towards one domain, balancing retaining performance across all domains is challenging.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes