LGJun 6, 2023

Transferable Adversarial Robustness for Categorical Data via Universal Robust Embeddings

arXiv:2306.04064v23 citationsh-index: 37
AI Analysis

This addresses a critical gap in adversarial robustness for tabular data in high-risk domains like fraud detection and medical diagnosis, offering a novel transferable solution.

The paper tackles adversarial robustness for tabular data with categorical features by developing a method to train robust deep networks and transfer this robustness to other classifiers like boosted trees or random forests via universal robust embeddings, showing it outperforms existing techniques.

Research on adversarial robustness is primarily focused on image and text data. Yet, many scenarios in which lack of robustness can result in serious risks, such as fraud detection, medical diagnosis, or recommender systems often do not rely on images or text but instead on tabular data. Adversarial robustness in tabular data poses two serious challenges. First, tabular datasets often contain categorical features, and therefore cannot be tackled directly with existing optimization procedures. Second, in the tabular domain, algorithms that are not based on deep networks are widely used and offer great performance, but algorithms to enhance robustness are tailored to neural networks (e.g. adversarial training). In this paper, we tackle both challenges. We present a method that allows us to train adversarially robust deep networks for tabular data and to transfer this robustness to other classifiers via universal robust embeddings tailored to categorical data. These embeddings, created using a bilevel alternating minimization framework, can be transferred to boosted trees or random forests making them robust without the need for adversarial training while preserving their high accuracy on tabular data. We show that our methods outperform existing techniques within a practical threat model suitable for tabular data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes