IRCLHCJun 16, 2025

SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists

arXiv:2506.13188v11 citationsh-index: 3Has Code
Originality Incremental advance
AI Analysis

This provides a practical tool for investigative journalists to lower technical barriers in geolocation verification, though it is incremental as it builds on existing LLM methods for a specific domain.

The paper tackles the problem of complex query languages for accessing OpenStreetMap data by introducing SPOT, a natural language interface that enables non-technical users, such as investigative journalists, to perform geolocation verification with reliable accuracy.

OpenStreetMap (OSM) is a vital resource for investigative journalists doing geolocation verification. However, existing tools to query OSM data such as Overpass Turbo require familiarity with complex query languages, creating barriers for non-technical users. We present SPOT, an open source natural language interface that makes OSM's rich, tag-based geographic data more accessible through intuitive scene descriptions. SPOT interprets user inputs as structured representations of geospatial object configurations using fine-tuned Large Language Models (LLMs), with results being displayed in an interactive map interface. While more general geospatial search tasks are conceivable, SPOT is specifically designed for use in investigative journalism, addressing real-world challenges such as hallucinations in model output, inconsistencies in OSM tagging, and the noisy nature of user input. It combines a novel synthetic data pipeline with a semantic bundling system to enable robust, accurate query generation. To our knowledge, SPOT is the first system to achieve reliable natural language access to OSM data at this level of accuracy. By lowering the technical barrier to geolocation verification, SPOT contributes a practical tool to the broader efforts to support fact-checking and combat disinformation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes