CRAIDCLGAug 30, 2025

Federated Survival Analysis with Node-Level Differential Privacy: Private Kaplan-Meier Curves

arXiv:2509.00615v1h-index: 6
Originality Incremental advance
AI Analysis

This work addresses privacy concerns in federated survival analysis for healthcare applications, offering a practical solution without iterative training or heavy cryptography.

The paper tackled the problem of calculating Kaplan-Meier survival curves across multiple healthcare jurisdictions while protecting patient privacy using node-level differential privacy, achieving empirical log-rank type-I error below 15% for privacy budgets of 0.5 and higher.

We investigate how to calculate Kaplan-Meier survival curves across multiple health-care jurisdictions while protecting patient privacy with node-level differential privacy. Each site discloses its curve only once, adding Laplace noise whose scale is determined by the length of the common time grid; the server then averages the noisy curves, so the overall privacy budget remains unchanged. We benchmark four one-shot smoothing techniques: Discrete Cosine Transform, Haar Wavelet shrinkage, adaptive Total-Variation denoising, and a parametric Weibull fit on the NCCTG lung-cancer cohort under five privacy levels and three partition scenarios (uniform, moderately skewed, highly imbalanced). Total-Variation gives the best mean accuracy, whereas the frequency-domain smoothers offer stronger worst-case robustness and the Weibull model shows the most stable behaviour at the strictest privacy setting. Across all methods the released curves keep the empirical log-rank type-I error below fifteen percent for privacy budgets of 0.5 and higher, demonstrating that clinically useful survival information can be shared without iterative training or heavy cryptography.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes