LGCRDec 17, 2025

Bits for Privacy: Evaluating Post-Training Quantization via Membership Inference

arXiv:2512.15335v11 citationsh-index: 39TrustCom
Originality Incremental advance
AI Analysis

This work addresses privacy protection for practitioners deploying quantized models, but it is incremental as it systematically studies existing quantization methods rather than introducing new ones.

The paper tackles the problem of privacy leakage in deep neural networks by evaluating how post-training quantization affects membership inference attacks, finding that low-precision quantization can reduce vulnerability by up to an order of magnitude compared to full-precision models, though with decreased utility.

Deep neural networks are widely deployed with quantization techniques to reduce memory and computational costs by lowering the numerical precision of their parameters. While quantization alters model parameters and their outputs, existing privacy analyses primarily focus on full-precision models, leaving a gap in understanding how bit-width reduction can affect privacy leakage. We present the first systematic study of the privacy-utility relationship in post-training quantization (PTQ), a versatile family of methods that can be applied to pretrained models without further training. Using membership inference attacks as our evaluation framework, we analyze three popular PTQ algorithms-AdaRound, BRECQ, and OBC-across multiple precision levels (4-bit, 2-bit, and 1.58-bit) on CIFAR-10, CIFAR-100, and TinyImageNet datasets. Our findings consistently show that low-precision PTQs can reduce privacy leakage. In particular, lower-precision models demonstrate up to an order of magnitude reduction in membership inference vulnerability compared to their full-precision counterparts, albeit at the cost of decreased utility. Additional ablation studies on the 1.58-bit quantization level show that quantizing only the last layer at higher precision enables fine-grained control over the privacy-utility trade-off. These results offer actionable insights for practitioners to balance efficiency, utility, and privacy protection in real-world deployments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes