LGCRDec 16, 2021

Correlation inference attacks against machine learning models

arXiv:2112.08806v47 citations
Originality Incremental advance
AI Analysis

This work addresses privacy concerns for users of machine learning models by revealing vulnerabilities that could enable weaker adversaries to infer sensitive training data attributes.

The paper investigates correlation inference attacks to determine if machine learning models leak information about correlations between input variables in their training datasets, demonstrating that both logistic regression and multilayer perceptron models are vulnerable to such attacks.

Despite machine learning models being widely used today, the relationship between a model and its training dataset is not well understood. We explore correlation inference attacks, whether and when a model leaks information about the correlations between the input variables of its training dataset. We first propose a model-less attack, where an adversary exploits the spherical parametrization of correlation matrices alone to make an informed guess. Second, we propose a model-based attack, where an adversary exploits black-box model access to infer the correlations using minimal and realistic assumptions. Third, we evaluate our attacks against logistic regression and multilayer perceptron models on three tabular datasets and show the models to leak correlations. We finally show how extracted correlations can be used as building blocks for attribute inference attacks and enable weaker adversaries. Our results raise fundamental questions on what a model does and should remember from its training set.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes