LGDCJun 18, 2024

Training Diffusion Models with Federated Learning

arXiv:2406.12575v119 citations
Originality Incremental advance
AI Analysis

This addresses privacy and data authority concerns for users and organizations by enabling decentralized training of diffusion models, though it is incremental as it adapts existing methods.

The paper tackled the problem of centralized control in diffusion model training by proposing a federated learning scheme that enables collaborative training without exposing local data, achieving a 74% reduction in parameters exchanged while maintaining comparable image quality to centralized training.

The training of diffusion-based models for image generation is predominantly controlled by a select few Big Tech companies, raising concerns about privacy, copyright, and data authority due to their lack of transparency regarding training data. To ad-dress this issue, we propose a federated diffusion model scheme that enables the independent and collaborative training of diffusion models without exposing local data. Our approach adapts the Federated Averaging (FedAvg) algorithm to train a Denoising Diffusion Model (DDPM). Through a novel utilization of the underlying UNet backbone, we achieve a significant reduction of up to 74% in the number of parameters exchanged during training,compared to the naive FedAvg approach, whilst simultaneously maintaining image quality comparable to the centralized setting, as evaluated by the FID score.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes