LGAIMay 26

HEAL: Resilient and Self-* Hub-based Learning

arXiv:2605.274757.31 citationsh-index: 2
Predicted impact top 96% in LG · last 90 daysOriginality Incremental advance
AI Analysis

For decentralized learning systems, HEAL provides a fault-tolerant alternative to Federated Learning without a single point of failure.

HEAL introduces a decentralized learning framework that combines Federated, Gossip, and Epidemic Learning via a self-organizing overlay, achieving performance comparable to Federated Learning in crash-free settings and outperforming Gossip and Epidemic Learning under crashes and churn.

Decentralized learning enhances privacy, scalability, and fault tolerance by distributing data and computation across nodes. A popular approach is Federated learning, which relies on a central aggregator, yet faces challenges such as server vulnerabilities, scalability issues, privacy risks and most importantly, the single point of failure. Alternatively Gossip Learning and Epidemic Learning offer fully decentralization through peer-to-peer exchanges of model updates, ensuring robustness and privacy, at the price of slower model convergence. In this work, we introduce a novel decentralized learning framework called HEAL. HEAL is the first cross-layer decentralized learning framework that exploits an optimized self-organizing and self-healing underlying P2P overlay combining the strengths of Federated Learning, Gossip and Epidemic Learning. Leveraging the recently proposed Elevator algorithm, HEAL promotes dynamically chosen nodes to act as aggregators. Through simulations, we demonstrate that HEAL has similar performances to that of Federated Learning in crash-free settings, while being fully decentralized and fault-tolerant. In crash and churn prone environments HEAL outperforms Gossip and Epidemic Learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes