SDCLCRLGASNov 29, 2022

Model Extraction Attack against Self-supervised Speech Models

arXiv:2211.16044v21 citationsh-index: 52
Originality Incremental advance
AI Analysis

This addresses security vulnerabilities in speech AI systems, posing a threat to model owners, but it is incremental as it adapts existing attack methods to a new domain.

The paper tackles the problem of model extraction attacks against self-supervised speech models, proposing a two-stage framework that uses active sampling to steal model functionality with a small number of queries, achieving effective extraction without knowledge of the target model's architecture.

Self-supervised learning (SSL) speech models generate meaningful representations of given clips and achieve incredible performance across various downstream tasks. Model extraction attack (MEA) often refers to an adversary stealing the functionality of the victim model with only query access. In this work, we study the MEA problem against SSL speech model with a small number of queries. We propose a two-stage framework to extract the model. In the first stage, SSL is conducted on the large-scale unlabeled corpus to pre-train a small speech model. Secondly, we actively sample a small portion of clips from the unlabeled corpus and query the target model with these clips to acquire their representations as labels for the small model's second-stage training. Experiment results show that our sampling methods can effectively extract the target model without knowing any information about its model architecture.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes