LG AI CP MLFeb 23, 2025

Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies

arXiv:2502.17518v14.11 citationsh-index: 1

Originality Synthesis-oriented

AI Analysis

This provides incremental improvements for financial trading applications by integrating existing RL and classifier methods.

This paper tackles the problem of improving risk-return trade-offs in financial trading strategies by combining ensemble Reinforcement Learning (RL) models with traditional classifiers, demonstrating that ensemble methods consistently outperform base models in terms of risk-adjusted returns with better drawdown management and stability.

This paper presents a comprehensive study on the use of ensemble Reinforcement Learning (RL) models in financial trading strategies, leveraging classifier models to enhance performance. By combining RL algorithms such as A2C, PPO, and SAC with traditional classifiers like Support Vector Machines (SVM), Decision Trees, and Logistic Regression, we investigate how different classifier groups can be integrated to improve risk-return trade-offs. The study evaluates the effectiveness of various ensemble methods, comparing them with individual RL models across key financial metrics, including Cumulative Returns, Sharpe Ratios (SR), Calmar Ratios, and Maximum Drawdown (MDD). Our results demonstrate that ensemble methods consistently outperform base models in terms of risk-adjusted returns, providing better management of drawdowns and overall stability. However, we identify the sensitivity of ensemble performance to the choice of variance threshold τ, highlighting the importance of dynamic τ adjustment to achieve optimal performance. This study emphasizes the value of combining RL with classifiers for adaptive decision-making, with implications for financial trading, robotics, and other dynamic environments.

View on arXiv PDF

Similar