TR AI LGJan 14, 2023

PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets

arXiv:2302.00586v25.910 citationsh-index: 40

Originality Incremental advance

AI Analysis

This addresses the need for more comprehensive evaluation metrics for financial practitioners to deploy reinforcement learning methods in real-world markets, though it is incremental in building on existing FinRL research.

The paper tackles the problem of evaluating reinforcement learning methods in financial markets by introducing PRUDEX-Compass, a framework with 6 axes and 17 measures for systematic assessment, and demonstrates its usage by evaluating 8 methods on 4 real-world datasets, releasing public resources including datasets and implementations.

The financial markets, which involve more than $90 trillion market capitals, attract the attention of innumerable investors around the world. Recently, reinforcement learning in financial markets (FinRL) has emerged as a promising direction to train agents for making profitable investment decisions. However, the evaluation of most FinRL methods only focuses on profit-related measures and ignores many critical axes, which are far from satisfactory for financial practitioners to deploy these methods into real-world financial markets. Therefore, we introduce PRUDEX-Compass, which has 6 axes, i.e., Profitability, Risk-control, Universality, Diversity, rEliability, and eXplainability, with a total of 17 measures for a systematic evaluation. Specifically, i) we propose AlphaMix+ as a strong FinRL baseline, which leverages mixture-of-experts (MoE) and risk-sensitive approaches to make diversified risk-aware investment decisions, ii) we evaluate 8 FinRL methods in 4 long-term real-world datasets of influential financial markets to demonstrate the usage of our PRUDEX-Compass, iii) PRUDEX-Compass together with 4 real-world datasets, standard implementation of 8 FinRL methods and a portfolio management environment is released as public resources to facilitate the design and comparison of new FinRL methods. We hope that PRUDEX-Compass can not only shed light on future FinRL research to prevent untrustworthy results from stagnating FinRL into successful industry deployment but also provide a new challenging algorithm evaluation scenario for the reinforcement learning (RL) community.

View on arXiv PDF

Similar