LGCRSEAug 11, 2022

Safety and Performance, Why not Both? Bi-Objective Optimized Model Compression toward AI Software Deployment

arXiv:2208.05969v215 citationsh-index: 15
Originality Incremental advance
AI Analysis

This addresses the challenge of safely deploying compressed AI models on devices like smartphones, which is incremental as it builds on existing compression methods by integrating safety considerations.

The paper tackles the problem of compressing deep learning models for deployment on resource-limited devices while maintaining safety against attacks, proposing a test-driven sparse training framework called SafeCompress that achieves effective and generalized compression across multiple datasets and tasks.

The size of deep learning models in artificial intelligence (AI) software is increasing rapidly, which hinders the large-scale deployment on resource-restricted devices (e.g., smartphones). To mitigate this issue, AI software compression plays a crucial role, which aims to compress model size while keeping high performance. However, the intrinsic defects in the big model may be inherited by the compressed one. Such defects may be easily leveraged by attackers, since the compressed models are usually deployed in a large number of devices without adequate protection. In this paper, we try to address the safe model compression problem from a safety-performance co-optimization perspective. Specifically, inspired by the test-driven development (TDD) paradigm in software engineering, we propose a test-driven sparse training framework called SafeCompress. By simulating the attack mechanism as the safety test, SafeCompress can automatically compress a big model to a small one following the dynamic sparse training paradigm. Further, considering a representative attack, i.e., membership inference attack (MIA), we develop a concrete safe model compression mechanism, called MIA-SafeCompress. Extensive experiments are conducted to evaluate MIA-SafeCompress on five datasets for both computer vision and natural language processing tasks. The results verify the effectiveness and generalization of our method. We also discuss how to adapt SafeCompress to other attacks besides MIA, demonstrating the flexibility of SafeCompress.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes