MLCRLGSTMay 21, 2025

A Linear Approach to Data Poisoning

arXiv:2505.15175v22 citationsh-index: 9
Originality Incremental advance
AI Analysis

This work addresses security vulnerabilities in ML systems for practitioners, though it appears incremental as it builds on existing theoretical foundations.

The paper tackled the problem of detecting data poisoning attacks in machine learning models by analyzing the Hessian's spectral signatures, showing that this theory applies to linear regression and extends to deep networks like CNNs and transformers, with experiments validating the approach.

We investigate the theoretical foundations of data poisoning attacks in machine learning models. Our analysis reveals that the Hessian with respect to the input serves as a diagnostic tool for detecting poisoning, exhibiting spectral signatures that characterize compromised datasets. We use random matrix theory (RMT) to develop a theory for the impact of poisoning proportion and regularisation on attack efficacy in linear regression. Through QR stepwise regression, we study the spectral signatures of the Hessian in multi-output regression. We perform experiments on deep networks to show experimentally that this theory extends to modern convolutional and transformer networks under the cross-entropy loss. Based on these insights we develop preliminary algorithms to determine if a network has been poisoned and remedies which do not require further training.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes