LGSEJul 31, 2025

Scalable and Precise Patch Robustness Certification for Deep Learning Models with Top-k Predictions

arXiv:2507.23335v1h-index: 5QRS
Originality Highly original
AI Analysis

This addresses the need for scalable and precise verification against adversarial patch attacks in deep learning systems, representing a strong specific gain rather than a foundational advancement.

The paper tackles the problem of certifying patch robustness for deep learning models with top-k predictions, proposing CostCert, which significantly outperforms the state-of-the-art defender PatchGuard by retaining up to 57.3% certified accuracy at a patch size of 96, where PatchGuard drops to zero.

Patch robustness certification is an emerging verification approach for defending against adversarial patch attacks with provable guarantees for deep learning systems. Certified recovery techniques guarantee the prediction of the sole true label of a certified sample. However, existing techniques, if applicable to top-k predictions, commonly conduct pairwise comparisons on those votes between labels, failing to certify the sole true label within the top k prediction labels precisely due to the inflation on the number of votes controlled by the attacker (i.e., attack budget); yet enumerating all combinations of vote allocation suffers from the combinatorial explosion problem. We propose CostCert, a novel, scalable, and precise voting-based certified recovery defender. CostCert verifies the true label of a sample within the top k predictions without pairwise comparisons and combinatorial explosion through a novel design: whether the attack budget on the sample is infeasible to cover the smallest total additional votes on top of the votes uncontrollable by the attacker to exclude the true labels from the top k prediction labels. Experiments show that CostCert significantly outperforms the current state-of-the-art defender PatchGuard, such as retaining up to 57.3% in certified accuracy when the patch size is 96, whereas PatchGuard has already dropped to zero.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes