LGAIJul 4, 2024

Adversarial Robustness of VAEs across Intersectional Subgroups

arXiv:2407.03864v21 citationsh-index: 26
Originality Incremental advance
AI Analysis

It addresses robustness disparities in VAEs for demographic subgroups, which is an incremental contribution to fairness in machine learning.

This study evaluated the adversarial robustness of Variational Autoencoders (VAEs) across demographic subgroups, finding that robustness disparities exist, with subgroups like older women being particularly vulnerable to misclassification due to adversarial perturbations.

Despite advancements in Autoencoders (AEs) for tasks like dimensionality reduction, representation learning and data generation, they remain vulnerable to adversarial attacks. Variational Autoencoders (VAEs), with their probabilistic approach to disentangling latent spaces, show stronger resistance to such perturbations compared to deterministic AEs; however, their resilience against adversarial inputs is still a concern. This study evaluates the robustness of VAEs against non-targeted adversarial attacks by optimizing minimal sample-specific perturbations to cause maximal damage across diverse demographic subgroups (combinations of age and gender). We investigate two questions: whether there are robustness disparities among subgroups, and what factors contribute to these disparities, such as data scarcity and representation entanglement. Our findings reveal that robustness disparities exist but are not always correlated with the size of the subgroup. By using downstream gender and age classifiers and examining latent embeddings, we highlight the vulnerability of subgroups like older women, who are prone to misclassification due to adversarial perturbations pushing their representations toward those of other subgroups.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes