LGJul 8, 2024Code
An open source Multi-Agent Deep Reinforcement Learning Routing Simulator for satellite networksFederico Lozano-Cuadra, Mathias D. Thorsager, Israel Leyva-Mayorga et al.
This paper introduces an open source simulator for packet routing in Low Earth Orbit Satellite Constellations (LSatCs) considering the dynamic system uncertainties. The simulator, implemented in Python, supports traditional Dijkstra's based routing as well as more advanced learning solutions, specifically Q-Routing and Multi-Agent Deep Reinforcement Learning (MA-DRL) from our previous work. It uses an event-based approach with the SimPy module to accurately simulate packet creation, routing and queuing, providing real-time tracking of queues and latency. The simulator is highly configurable, allowing adjustments in routing policies, traffic, ground and space layer topologies, communication parameters, and learning hyperparameters. Key features include the ability to visualize system motion and track packet paths. Results highlight significant improvements in end-to-end (E2E) latency using Reinforcement Learning (RL)-based routing policies compared to traditional methods. The source code, the documentation and a Jupyter notebook with post-processing results and analysis are available on GitHub.
NIMay 21
Impact of Atmospheric Turbulence and Pointing Error on Earth ObservationCelia Sánchez-de-Miguel, Antonio M. Mercado-Martínez, Beatriz Soret et al.
Earth Observation (EO) imagery is often degraded by atmospheric turbulence and pointing jitter; yet, these effects are rarely considered in datasets used to train AI-based detection models. Based on prior work, this paper presents an enhanced image simulator that enables the incorporation of vertical-path atmospheric turbulence and satellite pointing jitter, arising from platform and sensor vibrations, to generate physically realistic distorted images. As a case study, vessel detection is evaluated using YOLOv8 and RetinaNet on images generated by the proposed simulator under different levels of turbulence and pointing errors. Results show that YOLOv8 recall decreases from 91% under ideal conditions to 60% in the presence of weak turbulence, and falls below 40% under strong turbulence or jitter. In contrast, RetinaNet demonstrates greater robustness, maintaining approximately 75% recall across degraded conditions. These results highlight the importance of incorporating realistic physical degradations into EO training datasets to ensure reliable performance of AI-based models in operational environments, as demonstrated in maritime surveillance applications.
NIMay 11
Statistical Analysis for Energy-Efficient Satellite Edge Computing with Latency GuaranteesNicolai Dalsgaard Lyholm, Beatriz Soret, Tijana Devaja et al.
Being able to provide latency guarantees for orbital edge computing applications through Low Earth Orbit (LEO) satellite constellations is a major milestone for their integration into 5G and 6G networks. However, achieving this is fundamentally challenged by the inherent randomness in both communication and computing latency, driven by complex network dynamics, satellite motion, and hardware variability. In this paper, we perform a statistical analysis of the latency of satellite edge computing using representative computing hardware and an object detection algorithm running on a satellite image dataset. The resulting model captures the trade-off between data availability and estimation uncertainty, enabling data-driven optimization methods to meet latency targets with statistical guarantees while minimizing energy consumption. Our results show that parametric estimation and quantile regression for the execution time of the image processing algorithms can be effectively combined with models for the communication latency to select an optimal GPU clock frequency. This achieves a 95% probability of meeting a $500$ ms end-to-end deadline while reducing energy consumption by more than 50% compared to a baseline that relies on a Chebyshev-Cantelli inequality to bound execution-time quantiles. The proposed framework is generalizable across satellite edge computing workloads and hardware platforms.
LGMay 20, 2024
Continual Deep Reinforcement Learning for Decentralized Satellite RoutingFederico Lozano-Cuadra, Beatriz Soret, Israel Leyva-Mayorga et al.
This paper introduces a full solution for decentralized routing in Low Earth Orbit satellite constellations based on continual Deep Reinforcement Learning (DRL). This requires addressing multiple challenges, including the partial knowledge at the satellites and their continuous movement, and the time-varying sources of uncertainty in the system, such as traffic, communication links, or communication buffers. We follow a multi-agent approach, where each satellite acts as an independent decision-making agent, while acquiring a limited knowledge of the environment based on the feedback received from the nearby agents. The solution is divided into two phases. First, an offline learning phase relies on decentralized decisions and a global Deep Neural Network (DNN) trained with global experiences. Then, the online phase with local, on-board, and pre-trained DNNs requires continual learning to evolve with the environment, which can be done in two different ways: (1) Model anticipation, where the predictable conditions of the constellation are exploited by each satellite sharing local model with the next satellite; and (2) Federated Learning (FL), where each agent's model is merged first at the cluster level and then aggregated in a global Parameter Server. The results show that, without high congestion, the proposed Multi-Agent DRL framework achieves the same E2E performance as a shortest-path solution, but the latter assumes intensive communication overhead for real-time network-wise knowledge of the system at a centralized node, whereas ours only requires limited feedback exchange among first neighbour satellites. Importantly, our solution adapts well to congestion conditions and exploits less loaded paths. Moreover, the divergence of models over time is easily tackled by the synergy between anticipation, applied in short-term alignment, and FL, utilized for long-term alignment.
ITDec 8, 2023
Generative Network Layer for Communication Systems with Artificial IntelligenceMathias Thorsager, Israel Leyva-Mayorga, Beatriz Soret et al.
The traditional role of the network layer is the transfer of packet replicas from source to destination through intermediate network nodes. We present a generative network layer that uses Generative AI (GenAI) at intermediate or edge network nodes and analyze its impact on the required data rates in the network. We conduct a case study where the GenAI-aided nodes generate images from prompts that consist of substantially compressed latent representations. The results from network flow analyses under image quality constraints show that the generative network layer can achieve an improvement of more than 100% in terms of the required data rate.
LGFeb 27, 2024
Multi-Agent Deep Reinforcement Learning for Distributed Satellite RoutingFederico Lozano-Cuadra, Beatriz Soret
This paper introduces a Multi-Agent Deep Reinforcement Learning (MA-DRL) approach for routing in Low Earth Orbit Satellite Constellations (LSatCs). Each satellite is an independent decision-making agent with a partial knowledge of the environment, and supported by feedback received from the nearby agents. Building on our previous work that introduced a Q-routing solution, the contribution of this paper is to extend it to a deep learning framework able to quickly adapt to the network and traffic changes, and based on two phases: (1) An offline exploration learning phase that relies on a global Deep Neural Network (DNN) to learn the optimal paths at each possible position and congestion level; (2) An online exploitation phase with local, on-board, pre-trained DNNs. Results show that MA-DRL efficiently learns optimal routes offline that are then loaded for an efficient distributed routing online.
NIApr 7
Edge Intelligence for Satellite-based Earth Observation: Scheduling Image Acquisition and ProcessingBeatriz Soret, Antonio M. Mercado-Martínez, Antonio Jurado-Navas et al.
Modern Earth Observation (EO) missions generate massive volumes of imagery that challenge existing downlink and ground-processing capabilities, particularly for time-critical applications. This work investigates how a low Earth orbit (LEO) satellite constellation equipped with heterogeneous edge computing resources can enable real-time semantic processing of data acquired by EO satellites. We introduce an energy-aware framework that optimizes the use of resources accounting for data acquisition, computing, and communication constraints. Although we focus on maritime surveillance, the formulation is task-agnostic and accommodates a broad class of semantic and goal-oriented inference problems. Specifically, we formulate two coupled optimization problems: (i) observation scheduling, which selects image acquisition opportunities while accounting for turbulence-induced image degradation and energy budget, and (ii) processing scheduling, which allocates semantic workloads across onboard and ground processors. We evaluate these mechanisms for the task of detection and localization of vessels, for which we quantify the benefits of turbulence-aware observation scheduling for preserving image quality and experimentally characterize the execution-time distribution of YOLOv8 on different computing platforms. Results demonstrate that task- and turbulence-aware observation scheduling can significantly improve the quality and quantity of observed targets. Furthermore, cooperative edge processing within the constellation substantially reduces power consumption compared to traditional downlink-centric architectures. These findings highlight the potential of distributed edge intelligence to enhance the responsiveness and autonomy of future satellite-based EO systems.
LGNov 20, 2025
Real-Time Inference for Distributed Multimodal Systems under Communication Delay UncertaintyVictor Croisfelt, João Henrique Inacio de Souza, Shashi Raj Pandey et al.
Connected cyber-physical systems perform inference based on real-time inputs from multiple data streams. Uncertain communication delays across data streams challenge the temporal flow of the inference process. State-of-the-art (SotA) non-blocking inference methods rely on a reference-modality paradigm, requiring one modality input to be fully received before processing, while depending on costly offline profiling. We propose a novel, neuro-inspired non-blocking inference paradigm that primarily employs adaptive temporal windows of integration (TWIs) to dynamically adjust to stochastic delay patterns across heterogeneous streams while relaxing the reference-modality requirement. Our communication-delay-aware framework achieves robust real-time inference with finer-grained control over the accuracy-latency tradeoff. Experiments on the audio-visual event localization (AVEL) task demonstrate superior adaptability to network dynamics compared to SotA approaches.
MLOct 23, 2025
Learning Decentralized Routing Policies via Graph Attention-based Multi-Agent Reinforcement Learning in Lunar Delay-Tolerant NetworksFederico Lozano-Cuadra, Beatriz Soret, Marc Sanchez Net et al.
We present a fully decentralized routing framework for multi-robot exploration missions operating under the constraints of a Lunar Delay-Tolerant Network (LDTN). In this setting, autonomous rovers must relay collected data to a lander under intermittent connectivity and unknown mobility patterns. We formulate the problem as a Partially Observable Markov Decision Problem (POMDP) and propose a Graph Attention-based Multi-Agent Reinforcement Learning (GAT-MARL) policy that performs Centralized Training, Decentralized Execution (CTDE). Our method relies only on local observations and does not require global topology updates or packet replication, unlike classical approaches such as shortest path and controlled flooding-based algorithms. Through Monte Carlo simulations in randomized exploration environments, GAT-MARL provides higher delivery rates, no duplications, and fewer packet losses, and is able to leverage short-term mobility forecasts; offering a scalable solution for future space robotic systems for planetary exploration, as demonstrated by successful generalization to larger rover teams.
ROMar 3, 2025
An energy-efficient learning solution for the Agile Earth Observation Satellite Scheduling ProblemAntonio M. Mercado-Martínez, Beatriz Soret, Antonio Jurado-Navas
The Agile Earth Observation Satellite Scheduling Problem (AEOSSP) entails finding the subset of observation targets to be scheduled along the satellite's orbit while meeting operational constraints of time, energy and memory. The problem of deciding what and when to observe is inherently complex, and becomes even more challenging when considering several issues that compromise the quality of the captured images, such as cloud occlusion, atmospheric turbulence, and image resolution. This paper presents a Deep Reinforcement Learning (DRL) approach for addressing the AEOSSP with time-dependent profits, integrating these three factors to optimize the use of energy and memory resources. The proposed method involves a dual decision-making process: selecting the sequence of targets and determining the optimal observation time for each. Our results demonstrate that the proposed algorithm reduces the capture of images that fail to meet quality requirements by > 60% and consequently decreases energy waste from attitude maneuvers by up to 78%, all while maintaining strong observation performance.
DCOct 4, 2021
Learning, Computing, and Trustworthiness in Intelligent IoT Environments: Performance-Energy TradeoffsBeatriz Soret, Lam D. Nguyen, Jan Seeger et al.
An Intelligent IoT Environment (iIoTe) is comprised of heterogeneous devices that can collaboratively execute semi-autonomous IoT applications, examples of which include highly automated manufacturing cells or autonomously interacting harvesting machines. Energy efficiency is key in such edge environments, since they are often based on an infrastructure that consists of wireless and battery-run devices, e.g., e-tractors, drones, Automated Guided Vehicle (AGV)s and robots. The total energy consumption draws contributions from multipleiIoTe technologies that enable edge computing and communication, distributed learning, as well as distributed ledgers and smart contracts. This paper provides a state-of-the-art overview of these technologies and illustrates their functionality and performance, with special attention to the tradeoff among resources, latency, privacy and energy consumption. Finally, the paper provides a vision for integrating these enabling technologies in energy-efficient iIoTe and a roadmap to address the open research challenges