CLAICECPFeb 4, 2022

StonkBERT: Can Language Models Predict Medium-Run Stock Price Movements?

arXiv:2202.02268v12 citations
Originality Synthesis-oriented
AI Analysis

This work addresses stock market prediction for investors using language models, but it is incremental as it applies existing methods to new data sources.

The authors tackled the problem of predicting one-year stock price movements by fine-tuning transformer language models on company-related text data, achieving substantial improvements in predictive accuracy and above-average stock market returns, with news articles yielding the highest performance.

To answer this question, we fine-tune transformer-based language models, including BERT, on different sources of company-related text data for a classification task to predict the one-year stock price performance. We use three different types of text data: News articles, blogs, and annual reports. This allows us to analyze to what extent the performance of language models is dependent on the type of the underlying document. StonkBERT, our transformer-based stock performance classifier, shows substantial improvement in predictive accuracy compared to traditional language models. The highest performance was achieved with news articles as text source. Performance simulations indicate that these improvements in classification accuracy also translate into above-average stock market returns.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes