CVLGApr 16, 2017

Google's Cloud Vision API Is Not Robust To Noise

arXiv:1704.05051v2132 citations
Originality Incremental advance
AI Analysis

This work highlights a security vulnerability in a widely used commercial API, which could allow adversaries to bypass image filtering systems, though it is incremental as it focuses on a specific attack method.

The paper demonstrates that Google's Cloud Vision API is vulnerable to adversarial noise, showing that adding an average of 14.25% impulse noise can deceive the API into generating different outputs while humans perceive the original content, and suggests that noise filtering can mitigate this issue without algorithm updates.

Google has recently introduced the Cloud Vision API for image analysis. According to the demonstration website, the API "quickly classifies images into thousands of categories, detects individual objects and faces within images, and finds and reads printed words contained within images." It can be also used to "detect different types of inappropriate content from adult to violent content." In this paper, we evaluate the robustness of Google Cloud Vision API to input perturbation. In particular, we show that by adding sufficient noise to the image, the API generates completely different outputs for the noisy image, while a human observer would perceive its original content. We show that the attack is consistently successful, by performing extensive experiments on different image types, including natural images, images containing faces and images with texts. For instance, using images from ImageNet dataset, we found that adding an average of 14.25% impulse noise is enough to deceive the API. Our findings indicate the vulnerability of the API in adversarial environments. For example, an adversary can bypass an image filtering system by adding noise to inappropriate images. We then show that when a noise filter is applied on input images, the API generates mostly the same outputs for restored images as for original images. This observation suggests that cloud vision API can readily benefit from noise filtering, without the need for updating image analysis algorithms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes