LGAISEJun 11, 2021

ModelDiff: Testing-Based DNN Similarity Comparison for Model Reuse Detection

arXiv:2106.08890v169 citations
Originality Incremental advance
AI Analysis

This addresses a practical issue for model owners and security practitioners in verifying model originality, though it is an incremental improvement in detection methods.

The paper tackles the problem of detecting knowledge reuse in deep learning models to prevent intellectual property infringement and vulnerability propagation, achieving 91.7% correctness on a benchmark of 144 model pairs.

The knowledge of a deep learning model may be transferred to a student model, leading to intellectual property infringement or vulnerability propagation. Detecting such knowledge reuse is nontrivial because the suspect models may not be white-box accessible and/or may serve different tasks. In this paper, we propose ModelDiff, a testing-based approach to deep learning model similarity comparison. Instead of directly comparing the weights, activations, or outputs of two models, we compare their behavioral patterns on the same set of test inputs. Specifically, the behavioral pattern of a model is represented as a decision distance vector (DDV), in which each element is the distance between the model's reactions to a pair of inputs. The knowledge similarity between two models is measured with the cosine similarity between their DDVs. To evaluate ModelDiff, we created a benchmark that contains 144 pairs of models that cover most popular model reuse methods, including transfer learning, model compression, and model stealing. Our method achieved 91.7% correctness on the benchmark, which demonstrates the effectiveness of using ModelDiff for model reuse detection. A study on mobile deep learning apps has shown the feasibility of ModelDiff on real-world models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes