CVSep 6, 2025

JRN-Geo: A Joint Perception Network based on RGB and Normal images for Cross-view Geo-localization

arXiv:2509.05696v12 citationsh-index: 4ICRA
Originality Incremental advance
AI Analysis

This work addresses the problem of UAV localization and navigation by improving robustness to viewpoint variations, though it is incremental as it builds on existing methods by adding geometric features.

The paper tackles cross-view geo-localization by integrating geometric structural information from normal images with RGB images using a joint perception network, achieving state-of-the-art performance on datasets like University-1652 and SUES-200.

Cross-view geo-localization plays a critical role in Unmanned Aerial Vehicle (UAV) localization and navigation. However, significant challenges arise from the drastic viewpoint differences and appearance variations between images. Existing methods predominantly rely on semantic features from RGB images, often neglecting the importance of spatial structural information in capturing viewpoint-invariant features. To address this issue, we incorporate geometric structural information from normal images and introduce a Joint perception network to integrate RGB and Normal images (JRN-Geo). Our approach utilizes a dual-branch feature extraction framework, leveraging a Difference-Aware Fusion Module (DAFM) and Joint-Constrained Interaction Aggregation (JCIA) strategy to enable deep fusion and joint-constrained semantic and structural information representation. Furthermore, we propose a 3D geographic augmentation technique to generate potential viewpoint variation samples, enhancing the network's ability to learn viewpoint-invariant features. Extensive experiments on the University-1652 and SUES-200 datasets validate the robustness of our method against complex viewpoint ariations, achieving state-of-the-art performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes