CRCLJun 1, 2022

THE-X: Privacy-Preserving Transformer Inference with Homomorphic Encryption

Microsoft
arXiv:2206.00216v2669 citationsh-index: 102
Originality Incremental advance
AI Analysis

This addresses privacy concerns for cloud service users by enabling secure inference without exposing plain-text data, though it is an incremental improvement over existing HE methods.

The paper tackles the problem of privacy-preserving inference for transformer models on encrypted data using homomorphic encryption, proposing THE-X to handle non-polynomial functions like GELU and softmax, with experiments showing negligible performance drop across downstream tasks.

As more and more pre-trained language models adopt on-cloud deployment, the privacy issues grow quickly, mainly for the exposure of plain-text user data (e.g., search history, medical record, bank account). Privacy-preserving inference of transformer models is on the demand of cloud service users. To protect privacy, it is an attractive choice to compute only with ciphertext in homomorphic encryption (HE). However, enabling pre-trained models inference on ciphertext data is difficult due to the complex computations in transformer blocks, which are not supported by current HE tools yet. In this work, we introduce $\textit{THE-X}$, an approximation approach for transformers, which enables privacy-preserving inference of pre-trained models developed by popular frameworks. $\textit{THE-X}$ proposes a workflow to deal with complex computation in transformer networks, including all the non-polynomial functions like GELU, softmax, and LayerNorm. Experiments reveal our proposed $\textit{THE-X}$ can enable transformer inference on encrypted data for different downstream tasks, all with negligible performance drop but enjoying the theory-guaranteed privacy-preserving advantage.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes