CRLGSIJan 25, 2024

"All of Me": Mining Users' Attributes from their Public Spotify Playlists

arXiv:2401.14296v18 citationsWWW
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of user profiling for platforms and advertisers by leveraging public playlists, though it is incremental as it applies existing methods like DeepSets to a new domain.

The paper tackled the problem of predicting user attributes from public Spotify playlists by analyzing a dataset of 10,286 playlists from 739 users, finding statistical associations such as high openness correlating with diverse artists and female users preferring Pop and K-pop, and developed a DeepSet model that outperformed baselines in most attributes.

In the age of digital music streaming, playlists on platforms like Spotify have become an integral part of individuals' musical experiences. People create and publicly share their own playlists to express their musical tastes, promote the discovery of their favorite artists, and foster social connections. These publicly accessible playlists transcend the boundaries of mere musical preferences: they serve as sources of rich insights into users' attributes and identities. For example, the musical preferences of elderly individuals may lean more towards Frank Sinatra, while Billie Eilish remains a favored choice among teenagers. These playlists thus become windows into the diverse and evolving facets of one's musical identity. In this work, we investigate the relationship between Spotify users' attributes and their public playlists. In particular, we focus on identifying recurring musical characteristics associated with users' individual attributes, such as demographics, habits, or personality traits. To this end, we conducted an online survey involving 739 Spotify users, yielding a dataset of 10,286 publicly shared playlists encompassing over 200,000 unique songs and 55,000 artists. Through extensive statistical analyses, we first assess a deep connection between a user's Spotify playlists and their real-life attributes. For instance, we found individuals high in openness often create playlists featuring a diverse array of artists, while female users prefer Pop and K-pop music genres. Building upon these observed associations, we create accurate predictive models for users' attributes, presenting a novel DeepSet application that outperforms baselines in most of these users' attributes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes