LG CR MMAug 29, 2023

On the Steganographic Capacity of Selected Learning Models

Rishit Agrawal, Kelvin Jou, Tanush Obili, Daksh Parikh, Samarth Prajapati, Yash Seth, Charan Sridhar, Nathan Zhang, Mark Stamp

arXiv:2308.15502v13.82 citationsh-index: 36

Originality Incremental advance

AI Analysis

This work addresses the problem of steganographic capacity in learning models for security researchers, revealing vulnerabilities that could be exploited in attacks, though it is incremental as it builds on prior research about hiding information in models.

The paper investigates how many low-order bits of trained parameters in various machine learning models can be overwritten without degrading accuracy, finding that a majority of bits can be altered, with capacities ranging from 7.04 KB for Linear Regression to 44.74 MB for InceptionV3.

Machine learning and deep learning models are potential vectors for various attack scenarios. For example, previous research has shown that malware can be hidden in deep learning models. Hiding information in a learning model can be viewed as a form of steganography. In this research, we consider the general question of the steganographic capacity of learning models. Specifically, for a wide range of models, we determine the number of low-order bits of the trained parameters that can be overwritten, without adversely affecting model performance. For each model considered, we graph the accuracy as a function of the number of low-order bits that have been overwritten, and for selected models, we also analyze the steganographic capacity of individual layers. The models that we test include the classic machine learning techniques of Linear Regression (LR) and Support Vector Machine (SVM); the popular general deep learning models of Multilayer Perceptron (MLP) and Convolutional Neural Network (CNN); the highly-successful Recurrent Neural Network (RNN) architecture of Long Short-Term Memory (LSTM); the pre-trained transfer learning-based models VGG16, DenseNet121, InceptionV3, and Xception; and, finally, an Auxiliary Classifier Generative Adversarial Network (ACGAN). In all cases, we find that a majority of the bits of each trained parameter can be overwritten before the accuracy degrades. Of the models tested, the steganographic capacity ranges from 7.04 KB for our LR experiments, to 44.74 MB for InceptionV3. We discuss the implications of our results and consider possible avenues for further research.

View on arXiv PDF

Similar