CRLGFeb 11, 2025

Scalable Fingerprinting of Large Language Models

arXiv:2502.07760v29 citationsh-index: 55Has Code
Originality Highly original
AI Analysis

This addresses the need for model owners to securely identify shared models against leakage and coalitions, offering a scalable solution with significant improvements over existing methods.

The paper tackles the problem of scalable fingerprinting for large language models to enhance detection and security, introducing Perinucleus sampling to embed 24,576 fingerprints into a Llama-3.1-8B model without utility degradation, and showing persistence after fine-tuning.

Model fingerprinting has emerged as a powerful tool for model owners to identify their shared model given API access. However, to lower false discovery rate, fight fingerprint leakage, and defend against coalitions of model users attempting to bypass detection, we argue that {\em scalability} is critical, i.e., scaling up the number of fingerprints one can embed into a model. Hence, we pose scalability as a crucial requirement for fingerprinting schemes. We experiment with fingerprint design at a scale significantly larger than previously considered, and introduce a new method, dubbed Perinucleus sampling, to generate scalable, persistent, and harmless fingerprints. We demonstrate that this scheme can add 24,576 fingerprints to a Llama-3.1-8B model -- two orders of magnitude more than existing schemes -- without degrading the model's utility. Our inserted fingerprints persist even after supervised fine-tuning on standard post-training data. We further address security risks for fingerprinting, and theoretically and empirically show how a scalable fingerprinting scheme like ours can mitigate these risks. Our code is available at https://github.com/SewoongLab/scalable-fingerprinting-of-llms

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes