TR LGMay 14, 2025

An Efficient deep learning model to Predict Stock Price Movement Based on Limit Order Book

Jiahao Yang, Ran Fang, Ming Zhang, Jun Zhou

arXiv:2505.22678v11.2h-index: 5

Originality Synthesis-oriented

AI Analysis

This work addresses a domain-specific problem for high-frequency traders by offering an incremental improvement in modeling stock price movements.

The paper tackles the challenge of predicting stock price movements from high-dimensional limit order book data by proposing a Siamese architecture that processes ask and bid sides separately with shared parameters, improving performance in over 75% of cases compared to strong baselines.

In high-frequency trading (HFT), leveraging limit order books (LOB) to model stock price movements is crucial for achieving profitable outcomes. However, this task is challenging due to the high-dimensional and volatile nature of the original data. Even recent deep learning models often struggle to capture price movement patterns effectively, particularly without well-designed features. We observed that raw LOB data exhibits inherent symmetry between the ask and bid sides, and the bid-ask differences demonstrate greater stability and lower complexity compared to the original data. Building on this insight, we propose a novel approach in which leverages the Siamese architecture to enhance the performance of existing deep learning models. The core idea involves processing the ask and bid sides separately using the same module with shared parameters. We applied our Siamese-based methods to several widely used strong baselines and validated their effectiveness using data from 14 military industry stocks in the Chinese A-share market. Furthermore, we integrated multi-head attention (MHA) mechanisms with the Long Short-Term Memory (LSTM) module to investigate its role in modeling stock price movements. Our experiments used raw data and widely used Order Flow Imbalance (OFI) features as input with some strong baseline models. The results show that our method improves the performance of strong baselines in over 75$% of cases, excluding the Multi-Layer Perception (MLP) baseline, which performed poorly and is not considered practical. Furthermore, we found that Multi-Head Attention can enhance model performance, particularly over shorter forecasting horizons.

View on arXiv PDF

Similar