LGAIOct 28, 2021

Generating synthetic transactional profiles

arXiv:2111.01531v2
Originality Synthesis-oriented
AI Analysis

This work addresses privacy concerns for financial institutions by creating synthetic data that mimics real transactional behavior, though it appears incremental in applying existing techniques to a specific domain.

The paper tackled the problem of generating synthetic transactional profiles for financial applications, aiming to preserve both data utility and privacy, and found that neural network models can produce valuable synthetic data in this context.

Financial institutions use clients' payment transactions in numerous banking applications. Transactions are very personal and rich in behavioural patterns, often unique to individuals, which make them equivalent to personally identifiable information in some cases. In this paper, we generate synthetic transactional profiles using machine learning techniques with the goal to preserve both data utility and privacy. A challenge we faced was to deal with sparse vectors due to the few spending categories a client uses compared to all the ones available. We measured data utility by calculating common insights used by the banking industry on both the original and the synthetic data-set. Our approach shows that neural network models can generate valuable synthetic data in such context. Finally, we tried privacy-preserving techniques and observed its effect on models' performances.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes