SIJun 1

Enhancing the Socioeconomic Understanding of Foundation Models with Urban Mobility

Baoshen Guo, Donghang Li, Zhiqing Hong, Kailai Sun, Heye Huang, Alok Prakash, Shenhao Wang

arXiv:2606.0174566.7

AI Analysis

For urban analytics researchers, this work addresses the limitation of static place attributes by incorporating dynamic human mobility, though it is an incremental extension of existing foundation model fusion techniques.

The paper proposes MobFusion, a paradigm that integrates mobility networks into foundation models for urban socioeconomic prediction, achieving improvements in tasks like income, density, and crime prediction across three U.S. cities.

Foundation models have recently been applied to urban socioeconomic prediction using POI text, satellite imagery, and geospatial descriptions. However, these models mostly rely on static attributes of individual places, while ignoring the mobility patterns that reveal how places are functionally connected. To address this gap, we explore whether mobility networks can elicit the geospatial capabilities of foundation models by explicitly encoding connectivity among urban entities. We propose \textit{MobFusion}, a modular mobility-enhanced foundation model fusion paradigm, and instantiate it through three complementary designs: (i) mobility networks as contexts for zero-shot LLM prompting, (ii) as graph connectors for fusing geospatial visual embeddings with textual embeddings, and (iii) as structured tokens for multimodal LLM reasoning. Using anonymized large-scale mobility datasets from three U.S. metropolitan areas, we find that \textit{MobFusion} improves urban prediction tasks (e.g., median household income, population density, and crime prediction) across three instantiations, demonstrating that incorporating human mobility can effectively improve the socioeconomic understanding of foundation models.

View on arXiv PDF

Similar