ROMASYSYMay 30

Proactive-reactive detection and mitigation of intermittent faults in robot swarms

arXiv:2509.192467.8h-index: 13
Predicted impact top 99% in RO · last 90 daysOriginality Incremental advance
AI Analysis

For robot swarms with persistent network structures (e.g., using SoNS), this work provides a novel method to handle intermittent faults, a previously underexplored problem.

This paper addresses the challenge of detecting and mitigating intermittent faults in robot swarms, which are transient errors that disrupt coordination. The proposed proactive-reactive strategy, using self-organized backup layers and distributed consensus in a multiplex network, achieves high fault detection accuracy and low false positives, preventing intermittent faults from disrupting formation control.

Intermittent faults are transient errors that sporadically appear and disappear. Although intermittent faults pose substantial challenges to reliability and coordination, existing studies of fault tolerance in robot swarms focus instead on permanent faults. One reason for this is that intermittent faults are prohibitively difficult to detect in the fully self-organized ad-hoc networks typical of robot swarms, as their network topologies are transient and often unpredictable. However, in the recently introduced self-organizing nervous systems (SoNS) approach, robot swarms are able to self-organize persistent network structures for the first time, easing the problem of detecting intermittent faults. To address intermittent faults in robot swarms that have persistent networks, we propose a novel proactive-reactive strategy to detection and mitigation, based on self-organized backup layers and distributed consensus in a multiplex network. Proactively, the robots self-organize dynamic backup paths before faults occur, adapting to changes in the primary network topology and the robots' relative positions. Reactively, robots use one-shot likelihood ratio tests to compare information received along different paths in the multiplex network, enabling early fault detection. Upon detection, communication is temporarily rerouted in a self-organized way, until the detected fault resolves. We validate the approach in representative scenarios of faulty positional data occurring during formation control, demonstrating that intermittent faults are prevented from disrupting convergence to desired formations, with high fault detection accuracy and low rates of false positives.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes