LG AIJun 5, 2024

Efficient User Sequence Learning for Online Services via Compressed Graph Neural Networks

Yucheng Wu, Liyue Chen, Yu Cheng, Shuai Chen, Jinyu Xu, Leye Wang

arXiv:2406.02979v12.6Has Code

Originality Incremental advance

AI Analysis

This work addresses efficiency challenges for online services like fraud detection, though it is incremental as it builds on existing GNN and compression methods.

The paper tackles the computational inefficiency of applying Graph Neural Networks (GNNs) to large-scale user behavior sequences in online services by proposing ECSeq, a framework that integrates graph compression techniques, resulting in improved prediction accuracy by ~5% with minimal additional training time and preserved inference speed.

Learning representations of user behavior sequences is crucial for various online services, such as online fraudulent transaction detection mechanisms. Graph Neural Networks (GNNs) have been extensively applied to model sequence relationships, and extract information from similar sequences. While user behavior sequence data volume is usually huge for online applications, directly applying GNN models may lead to substantial computational overhead during both the training and inference stages and make it challenging to meet real-time requirements for online services. In this paper, we leverage graph compression techniques to alleviate the efficiency issue. Specifically, we propose a novel unified framework called ECSeq, to introduce graph compression techniques into relation modeling for user sequence representation learning. The key module of ECSeq is sequence relation modeling, which explores relationships among sequences to enhance sequence representation learning, and employs graph compression algorithms to achieve high efficiency and scalability. ECSeq also exhibits plug-and-play characteristics, seamlessly augmenting pre-trained sequence representation models without modifications. Empirical experiments on both sequence classification and regression tasks demonstrate the effectiveness of ECSeq. Specifically, with an additional training time of tens of seconds in total on 100,000+ sequences and inference time preserved within $10^{-4}$ seconds/sample, ECSeq improves the prediction R@P$_{0.9}$ of the widely used LSTM by $\sim 5\%$.

View on arXiv PDF Code

Similar