LGFeb 29, 2024

A Model-Based Approach for Improving Reinforcement Learning Efficiency Leveraging Expert Observations

Erhan Can Ozcan, Vittorio Giammarino, James Queeney, Ioannis Ch. Paschalidis

arXiv:2402.18836v12.63 citationsh-index: 41Has CodeCDC

Originality Incremental advance

AI Analysis

This work addresses sample efficiency for reinforcement learning practitioners, presenting an incremental improvement by combining existing methods.

The paper tackles the problem of improving sample efficiency in deep reinforcement learning by incorporating expert observations without explicit action information, achieving superior performance on continuous control tasks compared to benchmarks.

This paper investigates how to incorporate expert observations (without explicit information on expert actions) into a deep reinforcement learning setting to improve sample efficiency. First, we formulate an augmented policy loss combining a maximum entropy reinforcement learning objective with a behavioral cloning loss that leverages a forward dynamics model. Then, we propose an algorithm that automatically adjusts the weights of each component in the augmented loss function. Experiments on a variety of continuous control tasks demonstrate that the proposed algorithm outperforms various benchmarks by effectively utilizing available expert observations.

View on arXiv PDF Code

Similar