LG AIOct 31, 2025

EL-MIA: Quantifying Membership Inference Risks of Sensitive Entities in LLMs

Ali Satvaty, Suzan Verberne, Fatih Turkmen

arXiv:2511.00192v11 citationsh-index: 8

Originality Incremental advance

AI Analysis

This addresses privacy risks for sensitive data in LLMs, but is incremental as it builds on existing MIA techniques.

The paper tackled the problem of quantifying membership inference risks for sensitive entities in LLMs, proposing the EL-MIA framework and benchmark, and found that existing methods are limited for entity-level inference while susceptibility can be outlined with straightforward methods.

Membership inference attacks (MIA) aim to infer whether a particular data point is part of the training dataset of a model. In this paper, we propose a new task in the context of LLM privacy: entity-level discovery of membership risk focused on sensitive information (PII, credit card numbers, etc). Existing methods for MIA can detect the presence of entire prompts or documents in the LLM training data, but they fail to capture risks at a finer granularity. We propose the ``EL-MIA'' framework for auditing entity-level membership risks in LLMs. We construct a benchmark dataset for the evaluation of MIA methods on this task. Using this benchmark, we conduct a systematic comparison of existing MIA techniques as well as two newly proposed methods. We provide a comprehensive analysis of the results, trying to explain the relation of the entity level MIA susceptability with the model scale, training epochs, and other surface level factors. Our findings reveal that existing MIA methods are limited when it comes to entity-level membership inference of the sensitive attributes, while this susceptibility can be outlined with relatively straightforward methods, highlighting the need for stronger adversaries to stress test the provided threat model.

View on arXiv PDF

Similar