CRLGSep 18, 2024

Hard-Label Cryptanalytic Extraction of Neural Network Models

arXiv:2409.11646v17 citationsh-index: 6
Originality Highly original
AI Analysis

This addresses a critical security vulnerability for neural network models in scenarios where only hard labels are available, representing a significant advance over prior attacks limited to raw output access.

The paper tackles the problem of extracting neural network parameters under a hard-label setting where raw outputs are inaccessible, proposing the first attack that theoretically achieves functionally equivalent extraction for ReLU networks, validated by experiments on datasets like MNIST and CIFAR10, requiring only several hours for a network with 100,000 parameters.

The machine learning problem of extracting neural network parameters has been proposed for nearly three decades. Functionally equivalent extraction is a crucial goal for research on this problem. When the adversary has access to the raw output of neural networks, various attacks, including those presented at CRYPTO 2020 and EUROCRYPT 2024, have successfully achieved this goal. However, this goal is not achieved when neural networks operate under a hard-label setting where the raw output is inaccessible. In this paper, we propose the first attack that theoretically achieves functionally equivalent extraction under the hard-label setting, which applies to ReLU neural networks. The effectiveness of our attack is validated through practical experiments on a wide range of ReLU neural networks, including neural networks trained on two real benchmarking datasets (MNIST, CIFAR10) widely used in computer vision. For a neural network consisting of $10^5$ parameters, our attack only requires several hours on a single core.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes