CrossVIT-augmented Geospatial-Intelligence Visualization System for Tracking Economic Development Dynamics
This work provides an efficient tool for policymakers and researchers to improve resource allocation and economic planning, though it is incremental as it builds on existing Transformer and multimodal sensing methods.
The paper tackled the problem of tracking economic dynamics by developing Senseconomic, a scalable system that integrates remote sensing and street view images using cross-attention with nighttime light data as weak supervision, achieving an R-squared value of 0.8363 in county-level economic predictions and reducing processing time to 23 minutes.
Timely and accurate economic data is crucial for effective policymaking. Current challenges in data timeliness and spatial resolution can be addressed with advancements in multimodal sensing and distributed computing. We introduce Senseconomic, a scalable system for tracking economic dynamics via multimodal imagery and deep learning. Built on the Transformer framework, it integrates remote sensing and street view images using cross-attention, with nighttime light data as weak supervision. The system achieved an R-squared value of 0.8363 in county-level economic predictions and halved processing time to 23 minutes using distributed computing. Its user-friendly design includes a Vue3-based front end with Baidu maps for visualization and a Python-based back end automating tasks like image downloads and preprocessing. Senseconomic empowers policymakers and researchers with efficient tools for resource allocation and economic planning.