CLJan 20, 2025

RACCOON: A Retrieval-Augmented Generation Approach for Location Coordinate Capture from News Articles

arXiv:2501.11440v15 citationsh-index: 3Has CodeWWW
Originality Incremental advance
AI Analysis

This addresses geocoding for news articles, which is useful for tasks such as epidemic intelligence or disaster management, but it appears incremental as it applies an existing RAG method to a specific domain.

The paper tackles the problem of automatically extracting location coordinates from news articles for applications like epidemic intelligence, introducing RACCOON, a retrieval-augmented generation approach that uses LLMs to generate coordinates, with evaluation on three datasets and two LLMs showing its utility.

Geocoding involves automatic extraction of location coordinates of incidents reported in news articles, and can be used for epidemic intelligence or disaster management. This paper introduces Retrieval-Augmented Coordinate Capture Of Online News articles (RACCOON), an open-source geocoding approach that extracts geolocations from news articles. RACCOON uses a retrieval-augmented generation (RAG) approach where candidate locations and associated information are retrieved in the form of context from a location database, and a prompt containing the retrieved context, location mentions and news articles is fed to an LLM to generate the location coordinates. Our evaluation on three datasets, two underlying LLMs, three baselines and several ablation tests based on the components of RACCOON demonstrate the utility of RACCOON. To the best of our knowledge, RACCOON is the first RAG-based approach for geocoding using pre-trained LLMs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes