LG AIJan 29

Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement

Maria Despoina Siampou, Shushman Choudhury, Shang-Ling Hsu, Neha Arora, Cyrus Shahabi

arXiv:2601.21149v23.82 citationsh-index: 10

Originality Incremental advance

AI Analysis

This work addresses the need for more accurate and generalizable POI representations in geospatial AI, which is incremental by building on existing language and mobility models to capture missing usage signals.

The paper tackles the problem of learning general-purpose representations for points-of-interest (POIs) by incorporating POI function from human mobility data, rather than relying solely on static textual metadata or trajectory context. The result shows that augmenting text-based embeddings with their Mobility-Embedded POIs (ME-POIs) framework consistently outperforms baselines across five map enrichment tasks, with ME-POIs trained on mobility alone sometimes surpassing text-only models.

Recent progress in geospatial foundation models highlights the importance of learning general-purpose representations for real-world locations, particularly points-of-interest (POIs) where human activity concentrates. Existing approaches, however, focus primarily on place identity derived from static textual metadata, or learn representations tied to trajectory context, which capture movement regularities rather than how places are actually used (i.e., POI's function). We argue that POI function is a missing but essential signal for general POI representations. We introduce Mobility-Embedded POIs (ME-POIs), a framework that augments POI embeddings derived, from language models with large-scale human mobility data to learn POI-centric, context-independent representations grounded in real-world usage. ME-POIs encodes individual visits as temporally contextualized embeddings and aligns them with learnable POI representations via contrastive learning to capture usage patterns across users and time. To address long-tail sparsity, we propose a novel mechanism that propagates temporal visit patterns from nearby, frequently visited POIs across multiple spatial scales. We evaluate ME-POIs on five newly proposed map enrichment tasks, testing its ability to capture both the identity and function of POIs. Across all tasks, augmenting text-based embeddings with ME-POIs consistently outperforms both text-only and mobility-only baselines. Notably, ME-POIs trained on mobility data alone can surpass text-only models on certain tasks, highlighting that POI function is a critical component of accurate and generalizable POI representations.

View on arXiv PDF

Similar