LGAISep 16, 2021

Reinforcement Learning on Encrypted Data

arXiv:2109.08236v1
Originality Synthesis-oriented
AI Analysis

This addresses privacy concerns for RL applications in sensitive domains like secure sites, but it is incremental as it builds on existing MDP and encryption methods.

The paper tackled the problem of training reinforcement learning agents on encrypted data to protect sensitive inputs, developing an extension to the MDP framework for state encryption. Results showed that a DQN agent could learn in small state spaces with non-deterministic encryption, but performance collapsed in more complex environments.

The growing number of applications of Reinforcement Learning (RL) in real-world domains has led to the development of privacy-preserving techniques due to the inherently sensitive nature of data. Most existing works focus on differential privacy, in which information is revealed in the clear to an agent whose learned model should be robust against information leakage to malicious third parties. Motivated by use cases in which only encrypted data might be shared, such as information from sensitive sites, in this work we consider scenarios in which the inputs themselves are sensitive and cannot be revealed. We develop a simple extension to the MDP framework which provides for the encryption of states. We present a preliminary, experimental study of how a DQN agent trained on encrypted states performs in environments with discrete and continuous state spaces. Our results highlight that the agent is still capable of learning in small state spaces even in presence of non-deterministic encryption, but performance collapses in more complex environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes