A New Error Temporal Difference Algorithm for Deep Reinforcement Learning in Microgrid Optimization
This addresses uncertainty issues in microgrid control for energy management, but it is incremental as it builds on existing DRL methods.
The paper tackled uncertainty in deep reinforcement learning for microgrid optimization by proposing a new error temporal difference algorithm, which improved performance in simulations on a real-world dataset.
Predictive control approaches based on deep reinforcement learning (DRL) have gained significant attention in microgrid energy optimization. However, existing research often overlooks the issue of uncertainty stemming from imperfect prediction models, which can lead to suboptimal control strategies. This paper presents a new error temporal difference (ETD) algorithm for DRL to address the uncertainty in predictions,aiming to improve the performance of microgrid operations. First,a microgrid system integrated with renewable energy sources (RES) and energy storage systems (ESS), along with its Markov decision process (MDP), is modelled. Second, a predictive control approach based on a deep Q network (DQN) is presented, in which a weighted average algorithm and a new ETD algorithm are designed to quantify and address the prediction uncertainty, respectively. Finally, simulations on a realworld US dataset suggest that the developed ETD effectively improves the performance of DRL in optimizing microgrid operations.