IRLGSEMar 12, 2023

MobileRec: A Large-Scale Dataset for Mobile Apps Recommendation

arXiv:2303.06588v115 citationsh-index: 37Has Code
Originality Synthesis-oriented
AI Analysis

This dataset addresses a bottleneck for researchers in mobile app recommendation by providing a new benchmark, though it is incremental as it extends existing dataset creation efforts to a new domain.

The authors tackled the lack of high-quality benchmark datasets for mobile app recommendation systems by introducing MobileRec, a large-scale dataset with 19.3 million user interactions, 10K unique apps, and 0.7 million distinct users, and demonstrated its utility as a testbed through comparative studies of state-of-the-art methods.

Recommender systems have become ubiquitous in our digital lives, from recommending products on e-commerce websites to suggesting movies and music on streaming platforms. Existing recommendation datasets, such as Amazon Product Reviews and MovieLens, greatly facilitated the research and development of recommender systems in their respective domains. While the number of mobile users and applications (aka apps) has increased exponentially over the past decade, research in mobile app recommender systems has been significantly constrained, primarily due to the lack of high-quality benchmark datasets, as opposed to recommendations for products, movies, and news. To facilitate research for app recommendation systems, we introduce a large-scale dataset, called MobileRec. We constructed MobileRec from users' activity on the Google play store. MobileRec contains 19.3 million user interactions (i.e., user reviews on apps) with over 10K unique apps across 48 categories. MobileRec records the sequential activity of a total of 0.7 million distinct users. Each of these users has interacted with no fewer than five distinct apps, which stands in contrast to previous datasets on mobile apps that recorded only a single interaction per user. Furthermore, MobileRec presents users' ratings as well as sentiments on installed apps, and each app contains rich metadata such as app name, category, description, and overall rating, among others. We demonstrate that MobileRec can serve as an excellent testbed for app recommendation through a comparative study of several state-of-the-art recommendation approaches. The quantitative results can act as a baseline for other researchers to compare their results against. The MobileRec dataset is available at https://huggingface.co/datasets/recmeapp/mobilerec.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes