IR AIMay 23, 2025

BehaveGPT: A Foundation Model for Large-scale User Behavior Modeling

Jiahui Gong, Jingtao Ding, Fanjin Meng, Chen Yang, Hong Chen, Zuojian Wang, Haisheng Lu, Yong Li

arXiv:2505.17631v12 citationsh-index: 24

Originality Incremental advance

AI Analysis

This work addresses the problem of large-scale user behavior prediction for applications like recommendation systems, though it appears incremental as it adapts transformer-based methods to a specific domain.

The paper tackles the challenge of modeling complex user behavior by proposing BehaveGPT, a foundation model that achieves over 10% improvement in macro and weighted recall compared to state-of-the-art baselines on real-world datasets.

In recent years, foundational models have revolutionized the fields of language and vision, demonstrating remarkable abilities in understanding and generating complex data; however, similar advances in user behavior modeling have been limited, largely due to the complexity of behavioral data and the challenges involved in capturing intricate temporal and contextual relationships in user activities. To address this, we propose BehaveGPT, a foundational model designed specifically for large-scale user behavior prediction. Leveraging transformer-based architecture and a novel pretraining paradigm, BehaveGPT is trained on vast user behavior datasets, allowing it to learn complex behavior patterns and support a range of downstream tasks, including next behavior prediction, long-term generation, and cross-domain adaptation. Our approach introduces the DRO-based pretraining paradigm tailored for user behavior data, which improves model generalization and transferability by equitably modeling both head and tail behaviors. Extensive experiments on real-world datasets demonstrate that BehaveGPT outperforms state-of-the-art baselines, achieving more than a 10% improvement in macro and weighted recall, showcasing its ability to effectively capture and predict user behavior. Furthermore, we measure the scaling law in the user behavior domain for the first time on the Honor dataset, providing insights into how model performance scales with increased data and parameter sizes.

View on arXiv PDF

Similar