LGApr 9

Heterogeneous Graph Importance Scoring and Clustering with Automated LLM-based Interpretation

arXiv:2605.029192.7
Predicted impact top 84% in LG · last 90 daysOriginality Synthesis-oriented
AI Analysis

For urban infrastructure managers, this provides an automated, open-data pipeline to prioritize bridges for maintenance and disaster preparedness, though the method is incremental.

This paper develops a methodology to assess bridge importance using heterogeneous graph analysis and unsupervised clustering, achieving a 40x computational optimization and generating policy-relevant interpretations via LLMs. The approach is validated across multiple cities using only open data.

Urban bridge networks are critical infrastructure whose disruption can cascade into severe impacts on transportation, emergency services, and economic activity. This paper presents a comprehensive methodology for assessing bridge importance through heterogeneous graph analysis, unsupervised clustering, and automated interpretation via large language models (LLMs). Our approach addresses three fundamental challenges: (1) quantifying multi-dimensional bridge importance using only open data sources, (2) discovering functional bridge archetypes across different cities, and (3) generating policy-relevant interpretations automatically. We construct heterogeneous graphs from OpenStreetMap (OSM) data incorporating bridges, road networks, buildings, and public facilities. Five social impact indicators are computed: transit desert score, hospital access score, isolation risk score, supply chain impact score, and green space access score. These 52-dimensional feature vectors undergo dimensionality reduction via UMAP and density-based clustering via HDBSCAN. Discovered clusters are interpreted using temperature-optimized LLMs (Elyza8b, trained on construction domain corpus). (1) A complete open-data pipeline from OSM to actionable bridge importance rankings, (2) a five-indicator scoring methodology with 40$\times$ computational optimization, (3) a UMAP+HDBSCAN clustering framework validated on multi-city data, (4) an LLM interpretation methodology including temperature optimization and model selection rationale, and (5) transferability demonstration across cities via configuration-only adaptation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes