LGCRSYJun 2, 2024

Amalgam: A Framework for Obfuscated Neural Network Training on the Cloud

arXiv:2406.03405v22 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses privacy risks for users training proprietary models on cloud platforms, but it is an incremental improvement over existing obfuscation techniques.

The paper tackles the problem of exposing proprietary neural network models and datasets to cloud service providers during training by introducing Amalgam, a framework that obfuscates models and datasets with calibrated noise to preserve privacy, and it shows modest overheads without affecting model accuracy.

Training a proprietary Neural Network (NN) model with a proprietary dataset on the cloud comes at the risk of exposing the model architecture and the dataset to the cloud service provider. To tackle this problem, in this paper, we present an NN obfuscation framework, called Amalgam, to train NN models in a privacy-preserving manner in existing cloud-based environments. Amalgam achieves that by augmenting NN models and the datasets to be used for training with well-calibrated noise to "hide" both the original model architectures and training datasets from the cloud. After training, Amalgam extracts the original models from the augmented models and returns them to users. Our evaluation results with different computer vision and natural language processing models and datasets demonstrate that Amalgam: (i) introduces modest overheads into the training process without impacting its correctness, and (ii) does not affect the model's accuracy. The prototype implementation is available at: https://github.com/SifatTaj/amalgam

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes