CVJun 5, 2019

Multi-way Encoding for Robustness

Donghyun Kim, Sarah Adel Bargal, Jianming Zhang, Stan Sclaroff

arXiv:1906.02033v20.9

Originality Incremental advance

AI Analysis

This work addresses security issues in computer vision for practitioners and researchers, offering an incremental improvement by modifying output encoding to reduce adversarial vulnerability.

The paper tackles the vulnerability of deep models to adversarial examples by proposing multi-way encoding as an alternative to one-hot encoding, which decorrelates source and target models to enhance security. It demonstrates improved robustness against black-box and white-box attacks on datasets like MNIST and CIFAR-10, with applications in model watermarking.

Deep models are state-of-the-art for many computer vision tasks including image classification and object detection. However, it has been shown that deep models are vulnerable to adversarial examples. We highlight how one-hot encoding directly contributes to this vulnerability and propose breaking away from this widely-used, but highly-vulnerable mapping. We demonstrate that by leveraging a different output encoding, multi-way encoding, we decorrelate source and target models, making target models more secure. Our approach makes it more difficult for adversaries to find useful gradients for generating adversarial attacks. We present robustness for black-box and white-box attacks on four benchmark datasets: MNIST, CIFAR-10, CIFAR-100, and SVHN. The strength of our approach is also presented in the form of an attack for model watermarking, raising challenges in detecting stolen models.

View on arXiv PDF

Similar