ML LG STJan 23, 2023

Federated Sufficient Dimension Reduction Through High-Dimensional Sparse Sliced Inverse Regression

Wenquan Cui, Yue Zhao, Jianjun Xu, Haoyang Cheng

arXiv:2301.09500v12 citationsh-index: 42

Originality Incremental advance

AI Analysis

This work addresses the challenge of efficient and privacy-preserving dimension reduction in distributed data settings, representing an incremental advancement by adapting existing sliced inverse regression methods to a federated learning framework.

The paper tackles the problem of federated sufficient dimension reduction with high-dimensional sparse data by proposing a federated sparse sliced inverse regression algorithm, which estimates the central dimension reduction subspace and performs variable selection while maintaining data decentralization, achieving statistical error bounds validated through simulations and real-world applications.

Federated learning has become a popular tool in the big data era nowadays. It trains a centralized model based on data from different clients while keeping data decentralized. In this paper, we propose a federated sparse sliced inverse regression algorithm for the first time. Our method can simultaneously estimate the central dimension reduction subspace and perform variable selection in a federated setting. We transform this federated high-dimensional sparse sliced inverse regression problem into a convex optimization problem by constructing the covariance matrix safely and losslessly. We then use a linearized alternating direction method of multipliers algorithm to estimate the central subspace. We also give approaches of Bayesian information criterion and hold-out validation to ascertain the dimension of the central subspace and the hyper-parameter of the algorithm. We establish an upper bound of the statistical error rate of our estimator under the heterogeneous setting. We demonstrate the effectiveness of our method through simulations and real world applications.

View on arXiv PDF

Similar