CRAIMay 20, 2025

Can Large Language Models Really Recognize Your Name?

arXiv:2505.14549v18 citationsh-index: 16
Originality Incremental advance
AI Analysis

This exposes critical risks for users relying on LLM-based privacy solutions, as it challenges the assumption that LLMs can reliably protect sensitive data, highlighting an incremental but important gap in current methods.

The paper reveals that modern large language models (LLMs) systematically fail to detect ambiguous human names in text, with recall dropping by 20-40% compared to recognizable names, and these names are four times more likely to be ignored in privacy-preserving summaries.

Large language models (LLMs) are increasingly being used to protect sensitive user data. However, current LLM-based privacy solutions assume that these models can reliably detect personally identifiable information (PII), particularly named entities. In this paper, we challenge that assumption by revealing systematic failures in LLM-based privacy tasks. Specifically, we show that modern LLMs regularly overlook human names even in short text snippets due to ambiguous contexts, which cause the names to be misinterpreted or mishandled. We propose AMBENCH, a benchmark dataset of seemingly ambiguous human names, leveraging the name regularity bias phenomenon, embedded within concise text snippets along with benign prompt injections. Our experiments on modern LLMs tasked to detect PII as well as specialized tools show that recall of ambiguous names drops by 20--40% compared to more recognizable names. Furthermore, ambiguous human names are four times more likely to be ignored in supposedly privacy-preserving summaries generated by LLMs when benign prompt injections are present. These findings highlight the underexplored risks of relying solely on LLMs to safeguard user privacy and underscore the need for a more systematic investigation into their privacy failure modes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes