CVOct 18, 2022

SA-DNet: A on-demand semantic object registration network adapting to non-rigid deformation

Housheng Xie, Junhui Qiu, Yuan Dai, Yang Yang, Changcheng Xiang, Yukuan Zhang

arXiv:2210.09900v21.42 citationsh-index: 8

Originality Incremental advance

AI Analysis

This addresses the challenge of non-rigid distortions in multi-modal image registration for applications like image fusion, though it appears incremental as it builds on existing methods with semantic and hybrid matching enhancements.

The paper tackles the problem of non-rigid image registration for infrared and visible images, which is crucial for accurate fusion, by proposing SA-DNet, a semantic-aware network that confines feature matching to regions of interest, resulting in better adaptation to distortions and semantically well-registered images as shown in experiments comparing it with five state-of-the-art methods.

As an essential processing step before the fusing of infrared and visible images, the performance of image registration determines whether the two images can be fused at correct spatial position. In the actual scenario, the varied imaging devices may lead to a change in perspective or time gap between shots, making significant non-rigid spatial relationship in infrared and visible images. Even if a large number of feature points are matched, the registration accuracy may still be inadequate, affecting the result of image fusion and other vision tasks. To alleviate this problem, we propose a Semantic-Aware on-Demand registration network (SA-DNet), which mainly purpose is to confine the feature matching process to the semantic region of interest (sROI) by designing semantic-aware module (SAM) and HOL-Deep hybrid matching module (HDM). After utilizing TPS to transform infrared and visible images based on the corresponding feature points in sROI, the registered images are fused using image fusion module (IFM) to achieve a fully functional registration and fusion network. Moreover, we point out that for different demands, this type of approach allows us to select semantic objects for feature matching as needed and accomplishes task-specific registration based on specific requirements. To demonstrate the robustness of SA-DNet for non-rigid distortions, we conduct extensive experiments by comparing SA-DNet with five state-of-the-art infrared and visible image feature matching methods, and the experimental results show that our method adapts better to the presence of non-rigid distortions in the images and provides semantically well-registered images.

View on arXiv PDF

Similar