LGCRSep 23, 2021

Secure PAC Bayesian Regression via Real Shamir Secret Sharing

arXiv:2109.11200v31 citations
Originality Incremental advance
AI Analysis

This work addresses data privacy concerns in machine learning for scenarios where multiple parties hold sensitive data, though it is incremental as it builds on existing PAC Bayesian bounds and secret sharing techniques.

The paper tackles the problem of learning linear models while preserving data privacy across multiple parties, by introducing secure protocols based on real number secret sharing and multiparty computation to solve linear regression without exposing private data, with methods including secure inverse and Gaussian elimination that simplify protocols but may leak minimal information.

A common approach of system identification and machine learning is to generate a model by using training data to predict the test data instances as accurate as possible. Nonetheless, concerns about data privacy are increasingly raised, but not always addressed. We present a secure protocol for learning a linear model relying on recently described technique called real number secret sharing. We take as our starting point the PAC Bayesian bounds and deduce a closed form for the model parameters which depends on the data and the prior from the PAC Bayesian bounds. To obtain the model parameters one needs to solve a linear system. However, we consider the situation where several parties hold different data instances and they are not willing to give up the privacy of the data. Hence, we suggest to use real number secret sharing and multiparty computation to share the data and solve the linear regression in a secure way without violating the privacy of data. We suggest two methods; a secure inverse method and a secure Gaussian elimination method, and compare these methods at the end. The benefit of using secret sharing directly on real numbers is reflected in the simplicity of the protocols and the number of rounds needed. However, this comes with the drawback that a share might leak a small amount of information, but in our analysis we argue that the leakage is small.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes