LGAITRMay 9, 2021

Reinforcement Learning with Expert Trajectory For Quantitative Trading

arXiv:2105.03844v15 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of long-term goals and delayed rewards in futures trading for investors and researchers, though it appears incremental by building on existing reinforcement learning approaches.

The paper tackles the problem of quantitative trading by modeling price prediction as a Markov decision process and optimizing it with reinforcement learning using expert trajectories, showing advantages over existing methods on Chinese share price index futures like IF and IC.

In recent years, quantitative investment methods combined with artificial intelligence have attracted more and more attention from investors and researchers. Existing related methods based on the supervised learning are not very suitable for learning problems with long-term goals and delayed rewards in real futures trading. In this paper, therefore, we model the price prediction problem as a Markov decision process (MDP), and optimize it by reinforcement learning with expert trajectory. In the proposed method, we employ more than 100 short-term alpha factors instead of price, volume and several technical factors in used existing methods to describe the states of MDP. Furthermore, unlike DQN (deep Q-learning) and BC (behavior cloning) in related methods, we introduce expert experience in training stage, and consider both the expert-environment interaction and the agent-environment interaction to design the temporal difference error so that the agents are more adaptable for inevitable noise in financial data. Experimental results evaluated on share price index futures in China, including IF (CSI 300) and IC (CSI 500), show that the advantages of the proposed method compared with three typical technical analysis and two deep leaning based methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes