LGCRCVDCMay 11, 2022

Secure & Private Federated Neuroimaging

arXiv:2205.05249v23 citationsh-index: 40
Originality Incremental advance
AI Analysis

This addresses privacy and regulatory concerns for biomedical researchers and institutions, but it is incremental as it applies existing Federated Learning concepts with enhanced security to a specific domain.

The paper tackled the challenge of joint analysis of biomedical data across multiple sites by using Federated Learning to enable distributed training without sharing data, achieving strong security and privacy through encryption and information-theoretic methods in neuroimaging tasks like Alzheimer's disease prediction and BrainAGE estimation.

The amount of biomedical data continues to grow rapidly. However, collecting data from multiple sites for joint analysis remains challenging due to security, privacy, and regulatory concerns. To overcome this challenge, we use Federated Learning, which enables distributed training of neural network models over multiple data sources without sharing data. Each site trains the neural network over its private data for some time, then shares the neural network parameters (i.e., weights, gradients) with a Federation Controller, which in turn aggregates the local models, sends the resulting community model back to each site, and the process repeats. Our Federated Learning architecture, MetisFL, provides strong security and privacy. First, sample data never leaves a site. Second, neural network parameters are encrypted before transmission and the global neural model is computed under fully-homomorphic encryption. Finally, we use information-theoretic methods to limit information leakage from the neural model to prevent a curious site from performing model inversion or membership attacks. We present a thorough evaluation of the performance of secure, private federated learning in neuroimaging tasks, including for predicting Alzheimer's disease and estimating BrainAGE from magnetic resonance imaging (MRI) studies, in challenging, heterogeneous federated environments where sites have different amounts of data and statistical distributions.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes