CVAINov 6, 2024

Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions

arXiv:2411.03576v11 citationsh-index: 27IEEE Robot Autom Lett
Originality Incremental advance
AI Analysis

This addresses a critical issue for autonomous driving systems by improving robustness against sensor misalignment and failures, though it is incremental as it builds on existing fusion methods.

The paper tackles the problem of pedestrian detection in real-world conditions where RGB-Thermal image pairs may have partial overlap or sensor failure, proposing a Hybrid Attention mechanism that outperforms state-of-the-art methods in simulated scenarios.

Multispectral pedestrian detection has gained significant attention in recent years, particularly in autonomous driving applications. To address the challenges posed by adversarial illumination conditions, the combination of thermal and visible images has demonstrated its advantages. However, existing fusion methods rely on the critical assumption that the RGB-Thermal (RGB-T) image pairs are fully overlapping. These assumptions often do not hold in real-world applications, where only partial overlap between images can occur due to sensors configuration. Moreover, sensor failure can cause loss of information in one modality. In this paper, we propose a novel module called the Hybrid Attention (HA) mechanism as our main contribution to mitigate performance degradation caused by partial overlap and sensor failure, i.e. when at least part of the scene is acquired by only one sensor. We propose an improved RGB-T fusion algorithm, robust against partial overlap and sensor failure encountered during inference in real-world applications. We also leverage a mobile-friendly backbone to cope with resource constraints in embedded systems. We conducted experiments by simulating various partial overlap and sensor failure scenarios to evaluate the performance of our proposed method. The results demonstrate that our approach outperforms state-of-the-art methods, showcasing its superiority in handling real-world challenges.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes