CVDec 14, 2022

Multi-Modal Domain Fusion for Multi-modal Aerial View Object Classification

arXiv:2212.07039v11 citationsh-index: 6
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving automatic target recognition for aerial surveillance in varied conditions, representing an incremental advance in multi-modal fusion methods.

The paper tackles the problem of low accuracy in aerial view object classification by fusing electro-optical and synthetic aperture radar sensor data to overcome their individual limitations, resulting in top-10 and top-5 performances with accuracies of 25.3% and 34.26% on a benchmark dataset.

Object detection and classification using aerial images is a challenging task as the information regarding targets are not abundant. Synthetic Aperture Radar(SAR) images can be used for Automatic Target Recognition(ATR) systems as it can operate in all-weather conditions and in low light settings. But, SAR images contain salt and pepper noise(speckle noise) that cause hindrance for the deep learning models to extract meaningful features. Using just aerial view Electro-optical(EO) images for ATR systems may also not result in high accuracy as these images are of low resolution and also do not provide ample information in extreme weather conditions. Therefore, information from multiple sensors can be used to enhance the performance of Automatic Target Recognition(ATR) systems. In this paper, we explore a methodology to use both EO and SAR sensor information to effectively improve the performance of the ATR systems by handling the shortcomings of each of the sensors. A novel Multi-Modal Domain Fusion(MDF) network is proposed to learn the domain invariant features from multi-modal data and use it to accurately classify the aerial view objects. The proposed MDF network achieves top-10 performance in the Track-1 with an accuracy of 25.3 % and top-5 performance in Track-2 with an accuracy of 34.26 % in the test phase on the PBVS MAVOC Challenge dataset [18].

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes