LGTRMLDec 16, 2021

Trading with the Momentum Transformer: An Intelligent and Interpretable Architecture

arXiv:2112.08534v327 citations
Originality Incremental advance
AI Analysis

This work addresses trading strategy design for financial markets, presenting an incremental improvement by combining attention mechanisms with LSTMs for better interpretability and performance.

The paper tackles the problem of time-series trading by introducing the Momentum Transformer, an attention-based deep-learning architecture that outperforms benchmark momentum and mean-reversion strategies, with improved performance net of transaction costs and adaptability to new market regimes like the SARS-CoV-2 crisis.

We introduce the Momentum Transformer, an attention-based deep-learning architecture, which outperforms benchmark time-series momentum and mean-reversion trading strategies. Unlike state-of-the-art Long Short-Term Memory (LSTM) architectures, which are sequential in nature and tailored to local processing, an attention mechanism provides our architecture with a direct connection to all previous time-steps. Our architecture, an attention-LSTM hybrid, enables us to learn longer-term dependencies, improves performance when considering returns net of transaction costs and naturally adapts to new market regimes, such as during the SARS-CoV-2 crisis. Via the introduction of multiple attention heads, we can capture concurrent regimes, or temporal dynamics, which are occurring at different timescales. The Momentum Transformer is inherently interpretable, providing us with greater insights into our deep-learning momentum trading strategy, including the importance of different factors over time and the past time-steps which are of the greatest significance to the model.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes