CLDec 18, 2024

Memorization Over Reasoning? Exposing and Mitigating Verbatim Memorization in Large Language Models' Character Understanding Evaluation

arXiv:2412.14368v59 citationsh-index: 3
Originality Incremental advance
AI Analysis

This addresses data contamination in benchmarks for AI researchers, exposing that current evaluations may measure memorization over true understanding, which is incremental as it builds on existing concerns about memorization in LLMs.

The paper tackles the problem of large language models relying on verbatim memorization rather than genuine reasoning in character understanding tasks, showing that their method reduces memorization-driven accuracy from 96% to 72% and causes up to an 18% drop in performance across tasks.

Recently, Large Language Models (LLMs) have shown impressive performance in character understanding tasks, such as analyzing the roles, personalities, and relationships of fictional characters. However, the extensive pre-training corpora used by LLMs raise concerns that they may rely on memorizing popular fictional works rather than genuinely understanding and reasoning about them. In this work, we argue that 'gist memory'-capturing essential meaning - should be the primary mechanism for character understanding tasks, as opposed to 'verbatim memory' - exact match of a string. We introduce a simple yet effective method to mitigate mechanized memorization in character understanding evaluations while preserving the essential implicit cues needed for comprehension and reasoning. Our approach reduces memorization-driven performance on popular fictional works from 96% accuracy to 72% and results in up to an 18% drop in accuracy across various character understanding tasks. These findings underscore the issue of data contamination in existing benchmarks, which often measure memorization rather than true character understanding.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes