LG AI MLMar 8, 2018

A Multi-Objective Deep Reinforcement Learning Framework

Thanh Thi Nguyen, Ngoc Duy Nguyen, Peter Vamplew, Saeid Nahavandi, Richard Dazeley, Chee Peng Lim

arXiv:1803.02965v316.3146 citations

Originality Incremental advance

AI Analysis

It provides a generic and modular testbed platform to accelerate development for solving complex multi-objective problems, addressing limitations in existing methods.

The paper tackles the challenge of multi-objective deep reinforcement learning by introducing a scalable framework based on deep Q-networks, which effectively finds Pareto-optimal solutions in benchmark problems like the deep sea treasure environment and Mountain Car.

This paper introduces a new scalable multi-objective deep reinforcement learning (MODRL) framework based on deep Q-networks. We develop a high-performance MODRL framework that supports both single-policy and multi-policy strategies, as well as both linear and non-linear approaches to action selection. The experimental results on two benchmark problems (two-objective deep sea treasure environment and three-objective Mountain Car problem) indicate that the proposed framework is able to find the Pareto-optimal solutions effectively. The proposed framework is generic and highly modularized, which allows the integration of different deep reinforcement learning algorithms in different complex problem domains. This therefore overcomes many disadvantages involved with standard multi-objective reinforcement learning methods in the current literature. The proposed framework acts as a testbed platform that accelerates the development of MODRL for solving increasingly complicated multi-objective problems.

View on arXiv PDF

Similar