CR AIMar 6, 2024

Do You Trust Your Model? Emerging Malware Threats in the Deep Learning Ecosystem

Dorjan Hitaj, Giulio Pagnotta, Fabio De Gaspari, Sediola Ruko, Briland Hitaj, Luigi V. Mancini, Fernando Perez-Cruz

arXiv:2403.03593v38.512 citationsh-index: 13IEEE Transactions on Dependable and Secure Computing

Originality Highly original

AI Analysis

This work highlights a new supply chain threat for users of pre-trained models, raising awareness in the research and industry communities.

The paper tackles the problem of malware threats in deep learning by introducing MaleficNet 2.0, a technique that embeds self-extracting malware in neural network parameters without degrading model performance, demonstrating its effectiveness in proof-of-concept attacks.

Training high-quality deep learning models is a challenging task due to computational and technical requirements. A growing number of individuals, institutions, and companies increasingly rely on pre-trained, third-party models made available in public repositories. These models are often used directly or integrated in product pipelines with no particular precautions, since they are effectively just data in tensor form and considered safe. In this paper, we raise awareness of a new machine learning supply chain threat targeting neural networks. We introduce MaleficNet 2.0, a novel technique to embed self-extracting, self-executing malware in neural networks. MaleficNet 2.0 uses spread-spectrum channel coding combined with error correction techniques to inject malicious payloads in the parameters of deep neural networks. MaleficNet 2.0 injection technique is stealthy, does not degrade the performance of the model, and is robust against removal techniques. We design our approach to work both in traditional and distributed learning settings such as Federated Learning, and demonstrate that it is effective even when a reduced number of bits is used for the model parameters. Finally, we implement a proof-of-concept self-extracting neural network malware using MaleficNet 2.0, demonstrating the practicality of the attack against a widely adopted machine learning framework. Our aim with this work is to raise awareness against these new, dangerous attacks both in the research community and industry, and we hope to encourage further research in mitigation techniques against such threats.

View on arXiv PDF

Similar