LGSIMLJul 16, 2019

DeepTrax: Embedding Graphs of Financial Transactions

arXiv:1907.07225v153 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of analyzing financial transaction graphs for institutions, though it is incremental as it applies existing graph embedding methods to a new domain.

The authors tackled the problem of applying machine learning to large, sparse financial transaction graphs by learning embeddings for account and merchant entities, achieving high effectiveness as measured by link prediction AUC and F1 scores.

Financial transactions can be considered edges in a heterogeneous graph between entities sending money and entities receiving money. For financial institutions, such a graph is likely large (with millions or billions of edges) while also sparsely connected. It becomes challenging to apply machine learning to such large and sparse graphs. Graph representation learning seeks to embed the nodes of a graph into a Euclidean vector space such that graph topological properties are preserved after the transformation. In this paper, we present a novel application of representation learning to bipartite graphs of credit card transactions in order to learn embeddings of account and merchant entities. Our framework is inspired by popular approaches in graph embeddings and is trained on two internal transaction datasets. This approach yields highly effective embeddings, as quantified by link prediction AUC and F1 score. Further, the resulting entity vectors retain intuitive semantic similarity that is explored through visualizations and other qualitative analyses. Finally, we show how these embeddings can be used as features in downstream machine learning business applications such as fraud detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes