LGDLHCIRApr 27

CiteRadar: A Citation Intelligence Platform for Researcher Profiling and Geographic Visualization

NVIDIA
arXiv:2604.2505780.8h-index: 4Has Code
Predicted impact top 15% in LG · last 90 daysOriginality Incremental advance
AI Analysis

For individual researchers and small institutions lacking expensive bibliometric subscriptions, CiteRadar provides a free, automated way to profile citation geography and community structure.

CiteRadar is an open-source tool that, given a Google Scholar ID, produces a researcher's publication list, citing papers with metadata, ranked author tables, a statistical summary, and an interactive world map. It addresses the lack of accessible tools for geographic and community analysis of citations, with technical fixes that improve author disambiguation and location data coverage.

Understanding the geographic reach and community structure of one's scholarly citations is increasingly valuable for career development, grant applications, and collaboration discovery -- yet accessible tools for answering these questions remain scarce. Existing bibliometric platforms either require costly institutional subscriptions or expose only aggregate citation counts without granular per-author metadata. We present CiteRadar, an open-source system that accepts a single Google Scholar user identifier and automatically produces a structured output folder containing: the author's complete publication list, all retrieved citing papers with enriched author metadata, two ranked author tables (by citation frequency and by h-index), a plain-text statistical summary, and a self-contained interactive HTML world map -- all from a single command-line invocation. CiteRadar integrates five heterogeneous data sources -- Google Scholar, OpenAlex, CrossRef, Semantic Scholar, and OpenStreetMap Nominatim -- through a carefully engineered five-stage pipeline. Key technical contributions include: (1) a Scholar meta-string parser resilient to Unicode non-breaking-space separators, a pervasive but undocumented quirk in Scholar's HTML that silently corrupts venue and year fields when unhandled; (2) a two-stage author disambiguation system using stop-word-filtered institution name similarity to guard against the well-known same-name entity-merging failure mode in bibliometric databases, demonstrated to eliminate h-index attribution errors of up to 9x the correct value; (3) an OpenAlex web-URL to API-URL conversion fix that raises the fraction of author records with city-level location data from 0% to ~60%; and (4) a logarithmically-scaled interactive Folium world map with per-city researcher popups, rendered as a fully self-contained HTML file.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes