LGAug 7, 2025

Cold-Start Active Preference Learning in Socio-Economic Domains

arXiv:2508.05090v2h-index: 1Expert syst appl
Originality Incremental advance
AI Analysis

This addresses a practical issue for researchers and practitioners in socio-economic domains, offering an incremental improvement over existing methods.

The paper tackles the cold-start problem in active preference learning, where performance drops without initial labeled data, by proposing a self-supervised PCA method to generate pseudo-labels, which outperforms standard strategies on socio-economic datasets like financial credibility and career success rate.

Active preference learning offers an efficient approach to modeling preferences, but it is hindered by the cold-start problem, which leads to a marked decline in performance when no initial labeled data are available. While cold-start solutions have been proposed for domains such as vision and text, the cold-start problem in active preference learning remains largely unexplored, underscoring the need for practical, effective methods. Drawing inspiration from established practices in social and economic research, the proposed method initiates learning with a self-supervised phase that employs Principal Component Analysis (PCA) to generate initial pseudo-labels. This process produces a \say{warmed-up} model based solely on the data's intrinsic structure, without requiring expert input. The model is then refined through an active learning loop that strategically queries a simulated noisy oracle for labels. Experiments conducted on various socio-economic datasets, including those related to financial credibility, career success rate, and socio-economic status, consistently show that the PCA-driven approach outperforms standard active learning strategies that start without prior information. This work thus provides a computationally efficient and straightforward solution that effectively addresses the cold-start problem.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes