CVMay 21, 2025

Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation

arXiv:2505.15491v13 citationsh-index: 9
Originality Incremental advance
AI Analysis

This work addresses the problem of robust semantic segmentation in challenging conditions like low illumination for applications such as autonomous driving, representing an incremental improvement.

The paper tackles the challenge of effectively fusing RGB and thermal features for semantic segmentation by proposing a spectral-aware approach, achieving state-of-the-art performance on MFNet and PST900 datasets.

Semantic segmentation relying solely on RGB data often struggles in challenging conditions such as low illumination and obscured views, limiting its reliability in critical applications like autonomous driving. To address this, integrating additional thermal radiation data with RGB images demonstrates enhanced performance and robustness. However, how to effectively reconcile the modality discrepancies and fuse the RGB and thermal features remains a well-known challenge. In this work, we address this challenge from a novel spectral perspective. We observe that the multi-modal features can be categorized into two spectral components: low-frequency features that provide broad scene context, including color variations and smooth areas, and high-frequency features that capture modality-specific details such as edges and textures. Inspired by this, we propose the Spectral-aware Global Fusion Network (SGFNet) to effectively enhance and fuse the multi-modal features by explicitly modeling the interactions between the high-frequency, modality-specific features. Our experimental results demonstrate that SGFNet outperforms the state-of-the-art methods on the MFNet and PST900 datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes