CVAIDec 14, 2024

Medical Manifestation-Aware De-Identification

arXiv:2412.10804v17 citationsh-index: 7Has CodeAAAI
Originality Incremental advance
AI Analysis

This addresses the lack of large-scale patient face datasets for medical de-identification, enabling research in this domain while protecting patient privacy.

The paper tackles the problem of face de-identification in medical scenes by releasing MeMa, a dataset of over 40,000 photo-realistic patient faces, and proposes a baseline approach that substantially outperforms previous methods.

Face de-identification (DeID) has been widely studied for common scenes, but remains under-researched for medical scenes, mostly due to the lack of large-scale patient face datasets. In this paper, we release MeMa, consisting of over 40,000 photo-realistic patient faces. MeMa is re-generated from massive real patient photos. By carefully modulating the generation and data-filtering procedures, MeMa avoids breaching real patient privacy, while ensuring rich and plausible medical manifestations. We recruit expert clinicians to annotate MeMa with both coarse- and fine-grained labels, building the first medical-scene DeID benchmark. Additionally, we propose a baseline approach for this new medical-aware DeID task, by integrating data-driven medical semantic priors into the DeID procedure. Despite its conciseness and simplicity, our approach substantially outperforms previous ones. Dataset is available at https://github.com/tianyuan168326/MeMa-Pytorch.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes