Cristian Tatino

NI
h-index11
4papers
13citations
Novelty49%
AI Score31

4 Papers

LGJul 9, 2025
Generalization in Reinforcement Learning for Radio Access Networks

Burak Demirel, Yu Wang, Cristian Tatino et al.

Modern RAN operate in highly dynamic and heterogeneous environments, where hand-tuned, rule-based RRM algorithms often underperform. While RL can surpass such heuristics in constrained settings, the diversity of deployments and unpredictable radio conditions introduce major generalization challenges. Data-driven policies frequently overfit to training conditions, degrading performance in unseen scenarios. To address this, we propose a generalization-centered RL framework for RAN control that: (i) robustly reconstructs dynamically varying states from partial and noisy observations, while encoding static and semi-static information, such as radio nodes, cell attributes, and their topology, through graph representations; (ii) applies domain randomization to broaden the training distribution; and (iii) distributes data generation across multiple actors while centralizing training in a cloud-compatible architecture aligned with O-RAN principles. Although generalization increases computational and data-management complexity, our distributed design mitigates this by scaling data collection and training across diverse network conditions. Applied to downlink link adaptation in five 5G benchmarks, our policy improves average throughput and spectral efficiency by ~10% over an OLLA baseline (10% BLER target) in full-buffer MIMO/mMIMO and by >20% under high mobility. It matches specialized RL in full-buffer traffic and achieves up to 4- and 2-fold gains in eMBB and mixed-traffic benchmarks, respectively. In nine-cell deployments, GAT models offer 30% higher throughput over MLP baselines. These results, combined with our scalable architecture, offer a path toward AI-native 6G RAN using a single, generalizable RL agent.

NIDec 14, 2020
QoS Aware Robot Trajectory Optimization with IRS-Assisted Millimeter-Wave Communications

Cristian Tatino, Nikolaos Pappas, Di Yuan

In this paper, we consider the motion energy minimization problem for a robot that uses millimeter-wave (mm-wave) communications assisted by an intelligent reflective surface (IRS). The robot must perform tasks within given deadlines and it is subject to uplink quality of service (QoS) constraints. This problem is crucial for fully automated factories that are governed by the binomial of autonomous robots and new generations of mobile communications, i.e., 5G and 6G. In this new context, robot energy efficiency and communication reliability remain fundamental problems that couple in optimizing robot trajectory and communication QoS. More precisely, to account for the mutual dependency between robot position and communication QoS, robot trajectory and beamforming at the IRS and access point all need to be optimized. We present a solution that can decouple the two problems by exploiting mm-wave channel characteristics. Then, a closed-form solution is obtained for the beamforming optimization problem, whereas the trajectory is optimized by a novel successive-convex optimization-based algorithm that can deal with abrupt line-of-sight (LOS) to non-line-of-sight (NLOS) transitions. Specifically, the algorithm uses a radio map to avoid collisions with obstacles and poorly covered areas. We prove that the algorithm can converge to a solution satisfying the Karush-Kuhn-Tucker conditions. The simulation results show a fast convergence rate of the algorithm and a dramatic reduction of the motion energy consumption with respect to methods that aim to find maximum-rate trajectories. Moreover, we show that the use of passive IRSs represents a powerful solution to improve the radio coverage and motion energy efficiency of robots.

NIMar 21, 2020
Multi-Robot Association-Path Planning in Millimeter-Wave Industrial Scenarios

Cristian Tatino, Nikolaos Pappas, Di Yuan

The massive exploitation of robots for industry 4.0 needs advanced wireless solutions that replace less flexible and more costly wired networks. In this regard, millimeter-waves (mm-waves) can provide high data rates, but they are characterized by a spotty coverage requiring dense radio deployments. In such scenarios, coverage holes and numerous handovers may decrease the communication throughput and reliability. In contrast to conventional multi-robot path planning (MPP), we define a type of multi-robot association-path planning (MAPP) problems aiming to jointly optimize the robots' paths and the robots-access points (APs) associations. In MAPP, we focus on minimizing the path lengths as well as the number of handovers while sustaining connectivity. We propose an algorithm that can solve MAPP in polynomial time and it is able to numerically approach the global optimum. We show that the proposed solution is able to guarantee network connectivity and to dramatically reduce the number of handovers in comparison to minimizing only the path lengths.

SPMar 2, 2020
Learning-Based Link Scheduling in Millimeter-wave Multi-connectivity Scenarios

Cristian Tatino, Nikolaos Pappas, Ilaria Malanchini et al.

Multi-connectivity is emerging as a promising solution to provide reliable communications and seamless connectivity for the millimeter-wave frequency range. Due to the blockage sensitivity at such high frequencies, connectivity with multiple cells can drastically increase the network performance in terms of throughput and reliability. However, an inefficient link scheduling, i.e., over and under-provisioning of connections, can lead either to high interference and energy consumption or to unsatisfied user's quality of service (QoS) requirements. In this work, we present a learning-based solution that is able to learn and then to predict the optimal link scheduling to satisfy users' QoS requirements while avoiding communication interruptions. Moreover, we compare the proposed approach with two base line methods and the genie-aided link scheduling that assumes perfect channel knowledge. We show that the learning-based solution approaches the optimum and outperforms the base line methods.