CLVAE: A Variational Autoencoder for Long-Term Customer Revenue Forecasting

Jeffrey Näf, Riana Valera Mbelson, Markus Meierer

arXiv:2604.2263610.5

AI Analysis

For businesses in non-contractual settings, this model improves the efficiency of marketing resource allocation by providing more accurate long-term customer revenue forecasts.

The paper proposes a variational autoencoder (CLVAE) that combines a process-based likelihood for customer attrition, transactions, and spending with a flexible latent representation, improving long-term revenue forecasting over existing benchmarks across multiple real-world datasets.

Predicting customers' long-term revenue from sparse and irregular transaction data is central to marketing resource allocation in non-contractual settings, yet existing approaches face a trade-off. Traditional probabilistic customer base models deliver robust long-horizon forecasts by imposing strong structural assumptions, while flexible machine-learning models often require substantial training data and careful tuning. We propose a variational-autoencoder-based model that preserves the process-based likelihood of established attrition-transaction-spend models conditional on customer heterogeneity, but replaces the restrictive parametric mixing distribution with a flexible latent representation learned by encoder-decoder networks. The resulting approach (i) provides a single model for customer attrition, transactions and spending, (ii) remains reliable when contextual covariates are unavailable, and (iii) flexibly incorporates rich covariates and nonlinear effects when they are available. This design balances structural stability with the flexibility needed to capture complex purchase dynamics. Across multiple real-world datasets and prediction horizons, the proposed model improves upon the latest benchmarks. Businesses benefit directly, as a better assessment of customers' future revenues improves the efficiency of campaign targeting. For research, this work provides guidance on how to embed domain-specific models into the variational autoencoder framework, enabling flexible representation learning while retaining an econometrically meaningful process structure.

View on arXiv PDF

Similar