CVMay 17, 2025

Image-based Visibility Analysis Replacing Line-of-Sight Simulation: An Urban Landmark Perspective

arXiv:2505.11809v1h-index: 6
Originality Incremental advance
AI Analysis

This addresses the need for more perceptual and contextual visibility analysis in urban planning and related fields, though it is incremental as it builds upon existing image-based and graph methods.

The study tackled the problem of assessing urban landmark visibility by replacing traditional Line-of-Sight simulations with an image-based method using Vision Language Models on Street View Images, achieving an overall accuracy of 87% in detecting landmarks and revealing contextual differences and connection patterns, such as bridges accounting for 30% of connections along the River Thames.

Visibility analysis is one of the fundamental analytics methods in urban planning and landscape research, traditionally conducted through computational simulations based on the Line-of-Sight (LoS) principle. However, when assessing the visibility of named urban objects such as landmarks, geometric intersection alone fails to capture the contextual and perceptual dimensions of visibility as experienced in the real world. The study challenges the traditional LoS-based approaches by introducing a new, image-based visibility analysis method. Specifically, a Vision Language Model (VLM) is applied to detect the target object within a direction-zoomed Street View Image (SVI). Successful detection represents the object's visibility at the corresponding SVI location. Further, a heterogeneous visibility graph is constructed to address the complex interaction between observers and target objects. In the first case study, the method proves its reliability in detecting the visibility of six tall landmark constructions in global cities, with an overall accuracy of 87%. Furthermore, it reveals broader contextual differences when the landmarks are perceived and experienced. In the second case, the proposed visibility graph uncovers the form and strength of connections for multiple landmarks along the River Thames in London, as well as the places where these connections occur. Notably, bridges on the River Thames account for approximately 30% of total connections. Our method complements and enhances traditional LoS-based visibility analysis, and showcases the possibility of revealing the prevalent connection of any visual objects in the urban environment. It opens up new research perspectives for urban planning, heritage conservation, and computational social science.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes