CRLGMar 25, 2024

Machine Learning on Blockchain Data: A Systematic Mapping Study

arXiv:2403.17081v113 citationsh-index: 14Journal of the Association for Information Science and Technology
Originality Synthesis-oriented
AI Analysis

This work provides a comprehensive review for researchers and practitioners in blockchain and machine learning, identifying gaps for future research, but it is incremental as it synthesizes existing literature without new methods or data.

The paper conducted a systematic mapping study to review the state of the art on machine learning applied to blockchain data, analyzing 159 articles and finding that 49.7% focus on anomaly detection, 47.2% use Bitcoin data, and 46.5% apply classification tasks.

Context: Blockchain technology has drawn growing attention in the literature and in practice. Blockchain technology generates considerable amounts of data and has thus been a topic of interest for Machine Learning (ML). Objective: The objective of this paper is to provide a comprehensive review of the state of the art on machine learning applied to blockchain data. This work aims to systematically identify, analyze, and classify the literature on ML applied to blockchain data. This will allow us to discover the fields where more effort should be placed in future research. Method: A systematic mapping study has been conducted to identify the relevant literature. Ultimately, 159 articles were selected and classified according to various dimensions, specifically, the domain use case, the blockchain, the data, and the machine learning models. Results: The majority of the papers (49.7%) fall within the Anomaly use case. Bitcoin (47.2%) was the blockchain that drew the most attention. A dataset consisting of more than 1.000.000 data points was used by 31.4% of the papers. And Classification (46.5%) was the ML task most applied to blockchain data. Conclusion: The results confirm that ML applied to blockchain data is a relevant and a growing topic of interest both in the literature and in practice. Nevertheless, some open challenges and gaps remain, which can lead to future research directions. Specifically, we identify novel machine learning algorithms, the lack of a standardization framework, blockchain scalability issues and cross-chain interactions as areas worth exploring in the future.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes