CRAIMar 6, 2024

Do You Trust Your Model? Emerging Malware Threats in the Deep Learning Ecosystem

arXiv:2403.03593v312 citationsh-index: 13IEEE Transactions on Dependable and Secure Computing
Originality Highly original
AI Analysis

This work highlights a new supply chain threat for users of pre-trained models, raising awareness in the research and industry communities.

The paper tackles the problem of malware threats in deep learning by introducing MaleficNet 2.0, a technique that embeds self-extracting malware in neural network parameters without degrading model performance, demonstrating its effectiveness in proof-of-concept attacks.

Training high-quality deep learning models is a challenging task due to computational and technical requirements. A growing number of individuals, institutions, and companies increasingly rely on pre-trained, third-party models made available in public repositories. These models are often used directly or integrated in product pipelines with no particular precautions, since they are effectively just data in tensor form and considered safe. In this paper, we raise awareness of a new machine learning supply chain threat targeting neural networks. We introduce MaleficNet 2.0, a novel technique to embed self-extracting, self-executing malware in neural networks. MaleficNet 2.0 uses spread-spectrum channel coding combined with error correction techniques to inject malicious payloads in the parameters of deep neural networks. MaleficNet 2.0 injection technique is stealthy, does not degrade the performance of the model, and is robust against removal techniques. We design our approach to work both in traditional and distributed learning settings such as Federated Learning, and demonstrate that it is effective even when a reduced number of bits is used for the model parameters. Finally, we implement a proof-of-concept self-extracting neural network malware using MaleficNet 2.0, demonstrating the practicality of the attack against a widely adopted machine learning framework. Our aim with this work is to raise awareness against these new, dangerous attacks both in the research community and industry, and we hope to encourage further research in mitigation techniques against such threats.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes