An Intelligent eUPF for Time-Sensitive Path Selection in B5G Edge Networks
It addresses the need for intelligent traffic management in B5G edge networks to meet speed and reliability requirements.
The paper proposes an enhanced User Plane Function (eUPF) using a Deep Q-Network (DQN) for real-time path selection between MEC and cloud in B5G networks, achieving lower average latency and more stable rewards compared to a random baseline.
In Beyond 5G (B5G) networks, intelligent, flexible traffic management is essential to meet the stringent speed and reliability requirements of new applications. This paper presents an improved User Plane Function (eUPF) design that uses a Deep Q-Network (DQN) agent for real-time path selection between Multi-access Edge Computing (MEC) and cloud endpoints. The path selection problem is formulated as a Partially Observable Markov Decision Process (POMDP). We propose a novel passive delay measurement method that uses eBPF programs to link TEID-based timestamps in GTP-U traffic, allowing for low-cost delay estimation without active testing. Experiments show that the DQN agent substantially outperforms a random baseline, with lower average latency, more stable rewards, and more reliable low-delay path choices. These results demonstrate the effectiveness of AI-driven control in B5G core networks and the promise of reinforcement learning for modern network management.