StyleID: Identity Disentanglement for Anonymizing Faces
This addresses privacy concerns in AI for applications like autonomous cars and surveillance, offering an incremental improvement over existing methods.
The paper tackles the problem of anonymizing facial images in datasets by proposing StyleID, a framework that manipulates identity features in GAN latent space to protect privacy while preserving other facial characteristics, achieving state-of-the-art performance with tunable privacy and low computational complexity.
Privacy of machine learning models is one of the remaining challenges that hinder the broad adoption of Artificial Intelligent (AI). This paper considers this problem in the context of image datasets containing faces. Anonymization of such datasets is becoming increasingly important due to their central role in the training of autonomous cars, for example, and the vast amount of data generated by surveillance systems. While most prior work de-identifies facial images by modifying identity features in pixel space, we instead project the image onto the latent space of a Generative Adversarial Network (GAN) model, find the features that provide the biggest identity disentanglement, and then manipulate these features in latent space, pixel space, or both. The main contribution of the paper is the design of a feature-preserving anonymization framework, StyleID, which protects the individuals' identity, while preserving as many characteristics of the original faces in the image dataset as possible. As part of the contribution, we present a novel disentanglement metric, three complementing disentanglement methods, and new insights into identity disentanglement. StyleID provides tunable privacy, has low computational complexity, and is shown to outperform current state-of-the-art solutions.