AICYAug 30, 2025

SIGMUS: Semantic Integration for Knowledge Graphs in Multimodal Urban Spaces

arXiv:2509.00287v1h-index: 2
Originality Synthesis-oriented
AI Analysis

This addresses the challenge of automated incident analysis in urban spaces for city planners and emergency responders, though it appears incremental as it applies existing LLM technology to a specific domain.

The paper tackles the problem of integrating fragmented multimodal urban data to identify and reason about incidents like emergencies or events, by developing SIGMUS, a system that uses Large Language Models to automatically generate relationships between data sources and incidents, resulting in reasonable connections across five data types without human rules.

Modern urban spaces are equipped with an increasingly diverse set of sensors, all producing an abundance of multimodal data. Such multimodal data can be used to identify and reason about important incidents occurring in urban landscapes, such as major emergencies, cultural and social events, as well as natural disasters. However, such data may be fragmented over several sources and difficult to integrate due to the reliance on human-driven reasoning for identifying relationships between the multimodal data corresponding to an incident, as well as understanding the different components which define an incident. Such relationships and components are critical to identifying the causes of such incidents, as well as producing forecasting the scale and intensity of future incidents as they begin to develop. In this work, we create SIGMUS, a system for Semantic Integration for Knowledge Graphs in Multimodal Urban Spaces. SIGMUS uses Large Language Models (LLMs) to produce the necessary world knowledge for identifying relationships between incidents occurring in urban spaces and data from different modalities, allowing us to organize evidence and observations relevant to an incident without relying and human-encoded rules for relating multimodal sensory data with incidents. This organized knowledge is represented as a knowledge graph, organizing incidents, observations, and much more. We find that our system is able to produce reasonable connections between 5 different data sources (new article text, CCTV images, air quality, weather, and traffic measurements) and relevant incidents occurring at the same time and location.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes