CVMay 24, 2022

Privacy-Preserving Image Classification Using Vision Transformer

arXiv:2205.12041v120 citationsh-index: 35
Originality Incremental advance
AI Analysis

This addresses privacy concerns in image classification for applications like healthcare or surveillance, though it appears incremental as it adapts existing ViT methods to encrypted data.

The paper tackles privacy-preserving image classification by combining encrypted images with vision transformers (ViT), achieving high classification accuracy and outperforming state-of-the-art methods in accuracy and robustness against attacks.

In this paper, we propose a privacy-preserving image classification method that is based on the combined use of encrypted images and the vision transformer (ViT). The proposed method allows us not only to apply images without visual information to ViT models for both training and testing but to also maintain a high classification accuracy. ViT utilizes patch embedding and position embedding for image patches, so this architecture is shown to reduce the influence of block-wise image transformation. In an experiment, the proposed method for privacy-preserving image classification is demonstrated to outperform state-of-the-art methods in terms of classification accuracy and robustness against various attacks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes