An Uncertainty-Aware Resilience Micro-Agent for Causal Observability in the Computing Continuum

arXiv:2605.1071856.4
Predicted impact top 36% in DC · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses the problem of reliable diagnosis and mitigation of grey failures in the computing continuum for edge-tier environments, offering a practical solution with low overhead.

AURORA introduces a lightweight framework for diagnosing and mitigating grey failures in edge-tier environments, achieving 0% destructive action rate, 62.0% repair accuracy, and 3ms mean time to repair.

Grey failures in the computing continuum produce ambiguous overlapping symptoms that existing approaches fail to diagnose reliably, either due to a lack of causal awareness or acting under high epistemic uncertainty, risking destructive interventions. This paper presents an uncertainty-aware resilience micro-agent for causal observability (AURORA), a lightweight framework for diagnosing and mitigating grey failures in edge-tier environments. The framework employs parallel micro-agents that integrate the free-energy principle, causal do-calculus, and localized causal state-graphs to support counterfactual root-cause analysis within each fault's Markov blanket. Restricting inference to causally relevant variables reduces computational overhead while preserving diagnostic fidelity. AURORA further introduces a dual-gated execution mechanism that authorizes remediation only when causal confidence is high and predicted epistemic uncertainty is bounded; otherwise, it abstains from local intervention and escalates the diagnostic payload to the fog tier. Our experiments demonstrate that AURORA outperforms baselines, achieving a 0% destructive action rate, while maintaining 62.0% repair accuracy and a 3ms mean time to repair.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes