Riccardo Trivisonno

NI
4papers
Novelty49%
AI Score42

4 Papers

3.5NIMay 27
A Goal-Oriented Networking Approach for Intelligent IoT Service Deployment

Federico Tonini, Davide Borsatti, Wint Yi Poe et al.

The first 6G standardization efforts are about to start, shaping the new generation of mobile networks. The IMT-2030 extends the IMT-2020 by expanding its usage scenarios to Immersive, Massive, and Hyper-Reliable and Low-Latency Communications. It also introduces novel scenarios by integrating Artificial Intelligence and Sensing with Communication and supporting Ubiquitous Connectivity. Compared to the previous generation, 6G is expected to improve not only throughput and latency, but also coverage and energy efficiency. A paradigm called Goal-Oriented (GO) communications has recently emerged as a promising solution to improve network efficiency. It relies on the fact that the goal of the communication network is to achieve a specific task with a defined accuracy, rather than creating perfect data delivery. Intelligent devices can pre-process data to send only what is relevant to achieve the task, thus saving precious network resources and energy. Recent works demonstrate that incorporating service- and application-level KPIs in the network allows to achieve higher communication efficiency for devices, but the consequence of using such techniques on the network itself has not yet been explored. This paper proposes a practical end-to-end framework to assess energy consumption, latency, and goal accuracy KPIs, which includes a Multi-Objective optimization model to evaluate the trade-offs between the multiple KPIs relevant to GO networking. We demonstrate, through simulation, that the network can benefit from the application of the GO paradigm, indicating its potential in future network architectures.

85.9LGMay 20
Memory-R2: Fair Credit Assignment for Long-Horizon Memory-Augmented LLM Agents

Sikuan Yan, Ahmed Bahloul, Ercong Nie et al.

Memory-augmented LLM agents enable interactions that extend beyond finite context windows by storing, updating, and reusing information across sessions. However, training such agents with reinforcement learning in multi-session environments is challenging because memory turns the agent's past actions into part of its future environment. Once different rollouts write, update, or delete different memories, they no longer share the same intermediate memory state, making trajectory-level comparisons fundamentally unfair. This violates a key assumption behind group-relative methods such as GRPO, where rollouts are compared as if they were sampled from the same effective environment. Consequently, trajectory-level rewards provide noisy or biased credit signals for long-horizon memory operations. To address this challenge, we introduce Memory-R2, a training framework for long-horizon memory-augmented LLM agents. Its core algorithm, LoGo-GRPO, combines local and global group-relative optimization. The global objective preserves end-to-end learning from long-horizon trajectory-level rewards, while local rerollouts compare different memory-operation outcomes from the same intermediate memory state, yielding fairer group comparisons and more precise supervision for memory construction. Beyond credit assignment, Memory-R2 jointly optimizes memory formation and memory evolution with a shared-parameter co-learning design, where a fact extractor and a memory manager are instantiated from the same LLM backbone through role-specific prompts. To stabilize multi-step RL over long memory horizons, we adopt a progressive curriculum that increases the training horizon from 8 to 16 to 32 sessions. Together, these components provide an effective training paradigm for memory-augmented LLM agents in long-horizon multi-session settings.

89.7MAMay 16
PyraVid: Hierarchical Multimodal Memory for Long-Horizon Video Reasoning

Sikuan Yan, Sicheng Dong, Haotong Wang et al.

Memory has become an increasingly important component of agentic systems, as these systems are expected to reason over long-term experience. However, prior work has largely focused on unimodal memory, leaving multimodal memory relatively underexplored despite its central role in real-world applications. Compared with unimodal settings, multimodal memory introduces additional challenges, including heterogeneous input integration, person-centric information alignment, and evidence aggregation across different granularities. We present PyraVid, a hierarchical multimodal memory framework inspired by Event Segmentation Theory from cognitive science. PyraVid organizes long videos into a coarse-to-fine pyramid structure, enabling structured memory access and effective evidence aggregation. It further supports structure-guided memory expansion with pruning, allowing the retrieval of related events with strong causal connectivity but low semantic similarity while reducing noise. Experiments on multiple long-video understanding benchmarks show that PyraVid consistently improves performance across datasets, model scales, and question types, highlighting the effectiveness of hierarchical multimodal memory for long-horizon reasoning.

NIJul 28, 2021
A Distributed Intelligence Architecture for B5G Network Automation

Sayantini Majumdar, Riccardo Trivisonno, Georg Carle

The management of networks is automated by closed loops. Concurrent closed loops aiming for individual optimization cause conflicts which, left unresolved, leads to significant degradation in performance indicators, resulting in sub-optimal network performance. Centralized optimization avoids conflicts, but impractical in large-scale networks for time-critical applications. Distributed, pervasive intelligence is therefore envisaged in the evolution to B5G networks. In this letter, we propose a Q-Learning-based distributed architecture (QLC), addressing the conflict issue by encouraging cooperation among intelligent agents. We design a realistic B5G network slice auto-scaling model and validate the performance of QLC via simulations, justifying further research in this direction.