LGCYMay 7, 2025

Spatio-Temporal Graph Neural Network for Urban Spaces: Interpolating Citywide Traffic Volume

arXiv:2505.06292v15 citationsh-index: 5Has CodeExpert syst appl
Originality Incremental advance
AI Analysis

This work addresses the challenge of sparse traffic data for urban planners and traffic managers, offering a practical interpolation method that is incremental but tailored to urban-specific issues like zero-inflated distributions.

The paper tackles the problem of estimating street-level traffic volumes at unobserved locations in urban settings, where sensor data is sparse, by introducing the Graph Neural Network for Urban Interpolation (GNNUI), which outperforms existing methods across metrics like MAE (e.g., from 7.1 to 10.5 on Strava data) and remains robust even with as low as 1% sensor coverage.

Reliable street-level traffic volume data, covering multiple modes of transportation, helps urban planning by informing decisions on infrastructure improvements, traffic management, and public transportation. Yet, traffic sensors measuring traffic volume are typically scarcely located, due to their high deployment and maintenance costs. To address this, interpolation methods can estimate traffic volumes at unobserved locations using available data. Graph Neural Networks have shown strong performance in traffic volume forecasting, particularly on highways and major arterial networks. Applying them to urban settings, however, presents unique challenges: urban networks exhibit greater structural diversity, traffic volumes are highly overdispersed with many zeros, the best way to account for spatial dependencies remains unclear, and sensor coverage is often very sparse. We introduce the Graph Neural Network for Urban Interpolation (GNNUI), a novel urban traffic volume estimation approach. GNNUI employs a masking algorithm to learn interpolation, integrates node features to capture functional roles, and uses a loss function tailored to zero-inflated traffic distributions. In addition to the model, we introduce two new open, large-scale urban traffic volume benchmarks, covering different transportation modes: Strava cycling data from Berlin and New York City taxi data. GNNUI outperforms recent, some graph-based, interpolation methods across metrics (MAE, RMSE, true-zero rate, Kullback-Leibler divergence) and remains robust from 90% to 1% sensor coverage. On Strava, for instance, MAE rises only from 7.1 to 10.5, on Taxi from 23.0 to 40.4, demonstrating strong performance under extreme data scarcity, common in real-world urban settings. We also examine how graph connectivity choices influence model accuracy.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes